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Abstract 

We consider two parametrized random digraph families, namely, proportional-edge and central similarity 
proximity catch digraphs (PCDs) and compare the performance of these two PCD families in testing 
spatial point patterns. These PCD families are based on relative positions of data points from two classes 
and the relative density of the PCDs is used as a statistic for testing segregation and association against 
complete spatial randomness. When scaled properly, the relative density of a PCD is a [/-statistic. We 
extend the distribution of the relative density of central similarity PCDs for expansion parameter being 
larger than one. We compare the asymptotic distribution of the statistic for the two PCD families, using 
the standard central limit theory of [/-statistics. We compare finite sample performance of the tests by 
Monte Carlo simulations and prove the consistency of the tests under the alternatives. The asymptotic 
performance of the tests under the alternatives is assessed by Pitman's asymptotic efficiency. We find the 
optimal expansion parameters of the PCDs for testing each of the segregation and association alternatives 
in finite samples and in the limit. We demonstrate that in terms of empirical power (i.e., for finite samples) 
relative density of central similarity PCD has better performance (which occurs for expansion parameter 
values larger than one) under segregation alternative, while relative density of proportional-edge PCD 
has better performance under association alternative. The methods are illustrated in a real-life example 
from plant ecology. 

Keywords: association, complete spatial randomness, consistency, Dclaunay triangulation, Pitman asymp- 
totic efficiency, random proximity graphs, relative density, segregation, 



1 Introduction 



Spatial clustering has received considerable attention in the statistical literature. In recent years, a new 
clustering approach has been developed which uses data-random proximity catch digraphs (PCDs) and is 
based on the relative positions of the data points from various classes. A catch digraph is a directed graph 
whose vertices are pointed sets (a pointed set is a pair (S,p) where S is a set and p a disti nguished poin t ) with 
an arc from vertex (S u ,p u ) to vertex (S v ,p v ) whenever p v £ S u . Hence S u catches p v . iPriebe et al.l (|200l[ ) 
introduced the class cover catch digraphs (CCCDs) and gave the exact and the asymptotic distribution of 
the domination number of the CCCD in R. For two classes, X and y, of points, let X be the class of interest 
and y be the reference class and X n and y m be samples of size n and m from classes X and y, respectively. 
In the CCCD approach the points correspond to observations from class X and the sets are defined to be 
(open) balls centered at the points with maximal radius (relative to the other class y): S x = B(x,r(x)), 
where r{x) = d(x,y m ) is the minimum distance between the observation x G X and the observa t ions o f 
the other class, y m . The C C CD approach is exte nded to multiple d i mensio ns by iDeVinnev et al. (2002), 
Marchette and Priebd (|2003t ). IPriebe et al.l (|2003ah , and lPriebe et al.l (|2003bh . who demonstrated relatively 
good performance of it in classification by employing data reduction (condensing) based on approximate 
minimum dominating sets as prototype sets (since finding the exact minimum dominating set is an NP-hard 
problem — in particular for CCCDs). 
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Cevhanl (|2005l) generalized CCCDs to PCDs. In the PCD approach the points correspond to observations 



from class X and the sets are defined to be (closed) regions (usually convex regions or simply triangles) 
based on class X and y points and the regions increase as the distance of a class X point from class y 
points increases. The (non-parametrized) central similarity proximity map and parameterized proporti onal- 
edge proximity maps and t he associated random PCDs are introduced in ICevhan and Priebel ( 2003a ) and 
Cevhan and Priebel (|2003bl) . respectively. In both cases, the space is partitioned by the Delaunay tessellation 
of class y points which is the Delaunay triangulation in M 2 . In each triangle, a family of PCDs is constructed 
based on the relative positions of the X points with respect to each other and to y points. These proximity 
maps have the advantage that the calculations yielding the asymptotic distribution of the relative density 
are analytically tractable. 

(12000^ 



Recently, the use of mathematical graphs has gained popularity in spatial analysis (jRoberts et al 



providing a way to move beyond the usual Euclidean metrics for spatial analysis. Graph theory is well suited 
to ecological applicati ons concerned with conn ectivity or movement, although it is only recently introduced 
to landscape ecology ( Minor and Urban ( 20071 ) V Conventional graphs re duce the ut i lity of other geo-spatial 
information, because they do not explicitly maintain geographic reference. Fall et al] ( 20071) introduce spatial 
graphs that preserve the relevant spatial information by integrating a geometric reference system that ties 
patches and paths to specific spatial locations and spatial dim ensions. Howev er, usually the scale is lost 
after a graph is constructed using spatial data (see for instance. ISu et al.l (|2007l) ). Many concepts in spatial 
ecology depend on the idea of spatial adjacency which requires information on the close vicinity of an object. 
Graph theory conveniently can be adapted to express and communicate adjacency information allowing one 
to compute meaningful quantities related to a spatial point patte r n. Ad di ng vertex and edge prop erties to 
graphs extends the problem domain to network modeling ( Keittl ( 20071 )). IWu and Murray ( 2008 ) propose 
a new measure based on spatial interaction and graph theory, which reflects intra-patch an d inter-patch 
relationships by quantifying contiguity within and among patches. iFriedman and Rafskv ( 1983 ) also propose 
a graph-theoretic method to measure multivariate association, but their method is not designed to analyze 
spatial interaction between two or more classes; instead it is an extension of generalized correlation coefficient 
(such as Spearman's p or Kendall's r) to measure multivariate (possibly nonlinear) correlation. 

Intuitively, relative density should be useful for testing association or segregation. Under association, the 
observations from one class tend to cluster around those of the other, while under segregation they tend to 
avoid obser vations f rom th e other class. For exam ple, the pattern of spatial se gregation has been investigated 
for species (Digglel (120031) 1. age classes of plants ( Hamill and Wright ( 19861 ) ) and sexes of dioecious plants 
( Nanami et al.l ( 1999I )). Under association, the defining proximity regions tend to be small, and hence there 
should be fewer arcs; while under segregation, the proximity regions tend to be larger and cover many points, 
resulting in many arcs. Thus, the relative density (number of arcs divided by the total number of possible 
arcs) is a reasonable statistic to employ in this problem. Unfortunately, in the case of the CCCD, it is 
difficult to make precise calculations in multiple dimensions due to the geometry of the neighborhoods. The 
do mination number of the pr oportional-edge PCD with r = 3/2 is u sed for testing segregation or association 



Cevhan and Priebe ( 2005 ) and with general r in Cevhan ( 2010bl ) . 



This is appropriate when both classes are comparably large. Cevhan et al. ( 20061 ) used the relative density 
of the same proximity digraph for the same purpose which is appropriate when only size of one of the classes 
is large. The parameters of the PCDs expand the associated proximity region as a function of the distance 
from the point defining the proximity region to the vertices or edges of the triangles in which the point lies. 

In this article, we compare the two parameterized PCD families, namely proportional-edge and central 
similarity PCDs in testing bivariate spatial patterns. The graph invariant we use as a statistic is the relative 
density. We also extend the (expansion) paramet er of central similari t y PCD for values larger tha n one; 
previously it was defined on for the range of (0,1] (ICevhan and Priebel (|2003al ): ICevhan et all (l2007l) V We 
compare the finite sample performance of the relative density of these two PCD families by empirical size and 
power analysis based on extensive Monte Carlo simulations. We also compare the asymptotic distributions 
and asymptotic power performance of the tests under the alternatives. We first consider the case of one 
triangle, followed by the case of multiple triangles (based on the Delaunay triangulation of four or more y 
points). We also propose a correction term for the proportion of X points that lies outside the convex hull 
of y points. 

In Section [21 we provide a general definition of the proximity maps and the associated PCDs and their 
relative density, describe the two particular PCD families (namely, proportional-edge and central similarity 
PCDs). We provide the asymptotic distribution of relative density of the PCDs for uniform data in one 



and multiple triangles in Section [31 describe the alternative patterns of segregation and association, provide 
the asymptotic normality under the alternatives, present the standardized versions of the test statistics, and 
prove their consistency in Section 2] We present the empirical size performance of the PCDs in Section 
and empirical power analysis under the alternatives in Section [5] by extensive Monte Carlo simulations. 
The asymptotic performance of the tests is assessed by comparison of Pitman asymptotic efficiency scores 
in Section [7J We propose a correction method for the X points outside the convex hull of y m in Section [51 
illustrate the use of the tests in an ecological data set in Section [9] We present discussion and conclusions 
in Section 1101 Derivations of some of the quantities and lengthy expressions are deferred to the Appendix 
Sections. 



2 Proximity Maps and the Associated PCDs 

Our PCDs are based on the proximity maps which are defined in a fairly general setting. Let be 
a measurable space and consider a function N : x p(Sl) — > p(f2), where p(-) represents the power set 
function. Then given y m C f2, the proximity map N(-) = N(-,y m ) : f2 — > p(il) associates a proximity region 
N(x) C f2 with each point x £ fl. The region N(x) is defined in terms of the distance between x and y m . If 
X n := {Xi, X 2 , . . . , X n } is a set of ^-valued random variables, then the N(Xi), i = 1, 2, . . . ,n, are random 
sets. If the Xi are independent and identically distributed (iid), then so are the random sets N(Xi). 

Define the data-random PCD, D, with vertex set V = {Xi,X 2 , ■ ■ ■ 7 X n } and arc set A by (Xi,Xj) e 
A Xj € N(Xi). The random digraph D depends on the (joint) distribution of the Xi and on the map 

N. The adjective proximity — for the catch digraph D and for the map TV — comes from thinking of the 
region N(x) as representing tho se p oints in fl "close" to x. An exten sive treatment of the proximity graphs 
is presented in iToussaTn t (1980) and Jaromczvk and Toussaind ( 1992 ). 

The relative density of a digraph D = (V,A) of order |V| = n, denoted p(D), is defined as 



P(D) 



n(n — 1) 

where | • | stands for set cardinality ( Janson et al. ( 2000l )). Thus p(D) represents the ratio of the number of 
arcs in the digraph D to the number of arcs in the complete symmetric digraph of order n, which is n(n — 1). 

If Xi,X2, . . • , X n ~ F, then the relative density of the associated data-random PCD, denoted p(X n \ h, N), 
is a U- statistic, 



p{X n ;h,N) 



(1) 



where 



h(Xi, Xj ; TV) = !{{X h X,) 6 A) + I{(A„ X t ) G A] 
= \{X 3 e/V(A 4 )} + I{A 4 e/V(A,)}. 



(2) 



We denote h(Xi, Xj; N) as /i^ for brevity of notation. Since the digraph is asymmetric, is defined as the 
number of arcs in D between vertices Xi and Xj , in order to produce a symmetric kernel with finite variance 
(jLehmannl (jl988l) ). 



The random variable p n := p(X n ; h, N) depends on n and N explicitly and on F implicitly. The expec- 
tation E [/?„], however, is independent of n and depends on only F and N: 



< E[p n ] = -E[h 12 ] < 1 for all n > 2. 



The variance Var[p„] simplifies to 
< Var[p n ] 



-Var[/n 



n- 2 



2n(n — 1) n(n — 1) 

A central limit theorem for ^/-statistics ( Lehmann ( 19881) ) yields 



Cov[h 12 , h 13 ] < 1/4. 



(3) 



(4) 



\fn{p n - E[p n 



Af(Q,Cov[h 12 ,h 



13 



(5) 



provided Cov[/ii 2 , /113] > where Af(p,,(J 2 ) stands for the normal distribution with mean /i and variance 
o~ . The asymptotic variance of p n , Cav[h\2, his], depends on only F and N. Thus, we need determine only 
E[/ii2] and Cov[/ii 2 , /113] in order to obtain the normal approximation 

approx r , . . Tr . /E[/t,i 2 ] Cov [/li 2 , /113] \ . . 

p„ - (E[p n ],Var[p n ]) = Af L J , for large n. (6) 



2.1 The Proximity Map Families 

We now briefly define two proximity map families. Let SI = M. d and let y m = {yi,y2, ■ • ■ , y m } be m points in 
general position in M. d and T\ be the i th Delaunay cell for i = 1, 2, . . . , J m , where J m is the number of Delaunay 
cells. Let X n be a set of iid random variables from distribution F in R d with support S(F) C C^r^m) where 
CffCVm) stands for the convex hull of 3m- In particular, for illustrative purposes, we focus on R 2 where a 
Delaunay tessellation is a triangulation, provided that no more than three points in 3m are cocircular (i.e., 
lie on the same circle). Furthermore, for simplicity, let 3^3 = {yi;y2,y3} be three non-collinear points in M 2 
and T(3^3) = T(yi,y 2 ,y3) be the triangle with vertices y 3 . Let X n be a set of iid random variables from 
F with support S(F) C T(y 3 ). Let U(T (y 3 )) be the uniform distribution on T(y 3 ). If F = U(T(y 3 )), a 
composition of translation, rotation, reflections, and scaling will take any given triangle T(y 3 ) to the basic 
triangle T b = T((0, 0), (1, 0), (ci,c 2 )) with < ci < 1/2, c 2 > 0, and (l-ci) 2 + cl < 1, preserving uniformity. 
That is, if X ~ U(T(y 3 )) is transformed in the same manner to, say X', then we have X' ~ U(Tb). In fact 
this will hold for data from any distribution F up to scale. 



2.1.1 Proportional-Edge Proximity Maps and Associated Proximity Regions 

For the expansion parameter r € [l,oo], define Npe{x 1 t) to be the proportional- edge proximity map with 
expansion parameter r as follows; see also Figure [1] (left). Using line segments from the center of mass of 
^(3^3) to the midpoints of its edges, we partition T(J^) into "vertex regions" Rv(yi), i?i/(y 2 ), and i?y(ys). 
For x £ T(y 3 ) \ 3^3, let v(x) € 3^3 be the vertex in whose region x falls, so 1 € Rv{v(x)). If x falls on 
the boundary of two vertex regions, we assign v(x) arbitrarily to one of the adjacent regions. Let e(x) 
be the edge of T(y 3 ) opposite v(x). Let £(x) be the line parallel to e(x) through x. Let d(v(x),t(x)) be 
the Euclidean distance from v(x) to £{x). For r G [1, 00), let £ r (x) be the line parallel to e(x) such that 
d(v(x),£ r (x)) = rd(v(x), £(x)) and d(£(x),£ r (x)) < d(v(x),£ r (x)). Let TpE(x,r) be the triangle similar to 
and with the same orientation as T(3^3) having v(x) as a vertex and £ r (x) as the opposite edge. Then 
the proportional-edge proximity region NpE(x,r) is defined to be TpE{x,r) PI T(y 3 ). Notice that r > 1 
implies x £ Npe{x,t). Note also that lim,.-^ Npe(x, r) = T(y 3 ) for all x € ^(3^3) \ 3^3, so we define 
Npe(x, 00) = T(y 3 ) for a ll such x. For x £ 3^3, we define NpE{x,r) = {x} for all r <S [l,oo]. See 
Cevhan and Priebe (l2003bh for more detail. 



2.1.2 Central Similarity Proximity Maps and Associated Proximity Regions 

For the expansion parameter r € (0,oo], define Ncs( x ,t) to be the central similarity proximity map with 
expansion parameter r as follows; see also Figure Q] (right). Let ej be the edge opposite vertex y^ for 
j = 1,2,3, and let "edge regions" Re(si), Re{^2), Re^) partition ^(3^3) using line segments from the 
center of mass of T(y 3 ) to the vertices. For x £ (7X3^))°, let e(x) be the edge in whose region x falls; 
x £ i?g(e(x)). If x falls on the boundary of two edge regions we assign e(x) arbitrarily. For r > 0, the 
central similarity proximity region Ncs(x, t) is defined to be the triangle Tcs(x 7 T)r\T(y 3 ) with the following 
properties: 

(i) For r £ (0,1], the triangle Tcs(x,t) has an edge e T (x) parallel to e(x) such that d(x,e T (x)) = 
t d(x,e(x)) and d(e T (x),e(x)) < d(x, e(x)) and for r > 1, d(e T (x),e(x)) < d(x, e T (x)) where d(x,e(x)) 
is the Euclidean distance from x to e(x), 

(ii) the triangle Tcs{x,t) has the same orientation as and is similar to T(y 3 ), 

(iii) the point x is at the center of mass of Tcs(x,t). 

Note that (i) implies the expansion parameter r, (ii) implies "similarity" , and (iii) implies "central" in the 
name, (parametrized) central similarity proximity map. Notice that r > implies that x £ Ncs(x,t) and, 



by construction, we have N C s(x,t) C T(y 3 ) for all x G T(y 3 ). For a; G d(T(y 3 )) and t G (0, oo], we 
define Ncs{x,t) = {x}. For all x G T(3^3)° the edges e r (x) and e(x) are coincident iff r = 1. Note also 
that lirn^oo N CS (x,t) = T(y 3 ) for all x £ (T(y 3 ) )°, so we define Ngs(x, oo) = T(y 3 ) for all such x 
Observe that the central similarity proximity maps in lCevhan and Priebel (|2003al ) and lCevhan et all (|2007f ) 
are Ncs(', T ) with r = 1 and t G (0, 1], respectively. 

Remark 2.1. Notice that l ~ -F, with the additional assumption that the non-degenerate two-dimensional 
probability density function / exists with support(/) C T(y 3 ), implies that the special case in the construc- 
tion of Npe(-, r) — X falls on the boundary of two vertex regions — occurs with probability zero; similarly, 
the special case in the construction of Ncs(^t) — X falls on the boundary of two edge regions — occurs 
with probability zero. □ 




Figure 1: Plotted in the left is the illustration of the construction of proportional-edge proximity re- 
gion, Npe(x,t = 2) (shaded region) for an x G Rv(yi) where d\ = d(v(x),£(v(x),x)) and d 2 = 
d(v(x),£2(v(x),x)) = 2d(v(x),£(v(x),x)); and in the right is the illustration of the construction of central 
similarity proximity region, Ncs( x , T = 1/2) (shaded region) for an i € Re{s-3) where h 2 = d(x,e 3 (x)) = 
i d(x, e(x)) and hi = d(x, e(x)). 



3 The Asymptotic Distribution of Relative Density for Uniform 
Data 



3.1 The One Triangle Case 

For simplicity, we consider X points iid uniform in one triangle only. The null hypothesis we consider is a 
type of complete spatial randomness (CSR); that is, 

H : X % ~ U{T (y 3 )) for i = 1, 2, . . . , n. 

If it is desired to have the sample size be a random variable, we may consider a spatial Poisson point process 
on T (y 3 ) as our null hypothesis. 

We first present a "geometry invariance" result that will simplify our subsequent analysis by allowing us 
to consider the special case of the equilateral triangle. 

Theorem 3.1. (Geometry Invariance for Uniform Data) Let y 3 = {yi,y2,Y3} C K 2 be three non- 
collinear points. For i = 1, 2, . . . , n, let X t ~ F = U(T(y 3 )). Then 

(i) for any r G [l,oo] the distribution of relative density of proportional- edge PCDs, p PE (n,r), is indepen- 
dent ofy 3 , hence the geometry ofT(y 3 ). 

(ii) for any r G (0, oo] the distribution of relative density of central similarity PCDs, p cs (n, r) , is indepen- 
dent ofy 3 , hence the geometry ofT{y 3 ). 



The proof for (i) is are provided in lCevhan et al.l (|2006f ) and the proof of (ii) for t G (0,1] is provided in 



Cevhan et al.l (|2007[ ) and the proof for r > 1 is similar. 

In fact, the geometry invariance of p PE (n, oo) for data from any continuous distribution on T(3^) follows 
trivially, since for r = oo, p PE (n,r) = 1 a.s. (i.e., it is degenerate). Likewise, the geometry invariance of 
p cs (n, oo) for data from any continuous distribution on T(y 3 ) follows trivially, since for r = oo, p cs (n, r) = 1 
a.s. (i.e., it is degenerate). 

Based on Theorem 13.11 and our uniform null hypothesis, we may assume that Ttys) is a standard equi- 
lateral triangle with vertices y 3 = {(0, 0), (1, 0), (1/2, V3/2)} henceforth. 

Remark 3.2. Notice that, we proved the geometry invariance property for the relative density of PCDs 
based on proportional-edge proximity regions where vertex regions are defined with the lines joining 3^3 to 
the center of mass Mc- If we had used the orthogonal projections from Mc to the edges, the vertex regions 
(hence ATp^(-,r)) would depend on the geometry of the triangle. That is, the orthogonal projections from 
Mc to the edges will not be mapped to the orthogonal projections in the standard equilateral triangle. Hence 
the exact and asymptotic distribution of the relative density will depend on Ci,c 2 of T b , so one needs to do 
the calculations for each possible combination of c\ , c 2 . □ 



3.2 Asymptotic Normality under the Null Hypothesis 

By detailed geometric probability calculations, the means and the asymptotic varia nces of the rela t ive de nsity 
of the proport i onal- edge and central similarity PCDs can be calculated explicitly ( Cevhan et al. ( 20061 ) and 



Cevhan et al.l (|2007t )). 

The central limit theorem for [/-statistics then establishes the asymptotic normality under the uniform 
null hypothesis. For our proportional-edge proximity map and uniform null hypothesis, the asymptotic 
null distribution of p PE (n,r) can be derived as a function of r. Let p PE {r) := E[p PE (n, r)] and v PE (r) '■= 
Cov[/ii2, his]. Notice that p PE (r) = E[/ii 2 ]/2 = P(X 2 G Npe(Xi, r)) is the probability of an arc occurring 
between any pair of vertices, hence is called arc probability also. Similarly, the asymptotic null distribution of 
p cs (n, t) as a function of r can be derived. Let p cs (r) := E[p cs (n,r)], then p cs (t) = E[/ii 2 ]/2 = PyX.2 G 
Ncs(Xi,t)) and let v cs {t) := Cav\h\%, /113]. These results are summarized in the following theorems. 

Theorem 3.3. For r G [l,oo), 

Vn(p PE (n,r) ~ p PE (r)) c 



AA(0,1) (7) 



where 



^V 2 for re [1,3/2), 



216 ' 

{ -Ir 2 +4-8r- 1 + |r- 2 for rG [3/2,2), 



-r 2 for r G [2, 00), 



and 

!/ ps (r) = i/!(r) I(r £ [1,4/3)) + . 2 (r) I(r £ [4/3, 3/2)) + i/ 3 (r) I(r e [3/2, 2)) + i/ 4 (r) I(r e [2, 00]) (9) 



with 



3007 r 10 - 13824 r 9 + 898 r 8 + 77760 r 7 - 117953 r 6 + 48888 r° - 24246 r 4 + 60480 r 3 - 38880 r 2 + 3888 

58320 r 4 ' 
5467 r 10 - 37800 r 9 + 61912 r 8 + 46588 r 6 - 191520 r 5 + 13608 r 4 + 241920 r 3 - 155520 r 2 + 15552 



K ' 233280 r 4 

M r ) = ~[7r 12 - 72 r 11 +312r 10 - 5332 r 8 + 15072 r + 13704 r 6 - 139264 r 5 + 273600 r 4 - 242176 r 3 

+ 103232 r 2 - 27648 r + 8640]/[960 r 6 ], 

15r 4 -llr 2 -48r + 25 
Mr) = 157^ ' 

For r = 00, p PE {n,r) is degenerate. 



See lCevhan et al.l (|2006h for the proof. 



Theorem 3.4. For t G (0, oo), 



where 



(0,1)7 



Pcs( T ) - 



r 2 /6 /or re (0,1], 

2(l+2rK2+T) /° r TG(l.OO), 



(10) 



(11) 



r 4 (6 r 5 -3 r 4 -25 t 3 +t 2 +49 r+14) 
45(r+l)(2T + l)(r+2) 



/or re (0,1], 

^cs ( T ) 

I 168t 7 +886t 6 + 11; 

5(2t + 1) 4 (t+2) 

For t — 0, p cs (n, r) is degenerate. 

See Cevhan et al. < 20071 ) for the derivation for r £ (0, 1] and Appendix 1 for r > 1 



(12) 




Figure 2: Asymptotic null means (i.e., arc probabilities) (i PE (r) (left) and fi os (r) (right) as a function of the 
expansion parameters from Theorems 13.31 and 13.41 respectively. The vertical lines indicate the endpoints of 
the intervals in the piecewise definition of the functions. Notice that the vertical axes arc differently scaled. 

Consider the forms of the mean functions, which arc depicted in Figure [2j Note that p PE (r) is mono- 
tonically increasing in r, since Np E (x,r) increases with r for all x £ Ry(yj) \ &s{Np E (-,r), Mc), where 
Ms{N PE {-,r),Mc) := {x £ T(y 3 ) : N PE (x,r) = T(y 3 )}. In addition, p PE (r) -> 1 as r -> oo (at rate 
0(r -2 )), since the digraph becomes complete asymptotically, which explains why p PE (n,r) becomes de- 
generate, i.e., v PE (r = oo) = 0. p PE (r) is continuous, with the value at r = 1, fj, PE (l) = 37/216 ps 
.1713. Note also that p cs {r) is monotonically increasing in r, since Ncs{x,t) increases with r for all 
x £ R E (e j )\^ s (Ncs(;r),M c ), where M S {N CS {-, r), M c ) := {a; £ TO) : A cs (x,r) = T(y 3 )}. Note 
also that p cs {r) is continuous in r with p cs (t = 1) = 1/6 and lim r ^o P C s ( r ) = 0- In addition, /i cg (r) — > 1 
as t — ^ oo (at rate O (r -1 )), so p cs {n,r) becomes degenerate as r 
and A t cs( T ) are plotted together in Figure 0] (left). Observe that p JI 

T £ (0, OO). 

The asymptotic variance functions are depicted in Figure [3l Note that v PE (r) is also continuous in 
r with Iim^oo v PB (r) = and f PB (l) = 34/58320 ~ .000583 and observe that sup r>1 i/ p]! (r) « .1305 
which is attained at r « 2.045. Note also that v cs {t) is continuous in r with Iim T _> 00 f cg (r) = and 
v(t = 1) = 7/135 and lim r _»o v C s ( T ) = — there are no arcs when r = a.s. — which explains why 
p n (r = 0) is degenerate. Moreover, sup T>0 ^ cs (t) ss .1767 which is attained at r w 4.0051. The asymptotic 
variances v PE (r) and i / cs (t) are plotted together in Figure 0] (right). Observe that v cs (t) > v PE {r) for all 
r £ [l,oo) and r € (0, oo). 

approx 



> oo. The asymptotic means p PE (r) 
, (r) > p cs (r) for all r G [l,oo) and 



To illustrate the limiting distribution, r = 2 yields 
V^(Pp E K 2) - m pe (2)) _ 
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Figure 3: Asymptotic null variances v PE (r) (left) and v cs (r) (right) as a function of the expansion param- 
eters from Theorems I3.3I and I3.4I respectively. The vertical lines indicate the endpoints of the intervals in 
the piecewise definition of the functions. Notice that the vertical axes are differently scaled. 




Figure 4: Asymptotic null means (i.e., arc probabilities) (left) and variances (right) as a function of the 
expansion parameters for relative density of proportional-cdgc PCDs (solid line) and central similarity PCDs 
(dashed line). The vertical lines indicate the endpoints of the intervals in the piecewise definition of the 
functions. Notice that the vertical axes are differently scaled. 



c 

where — > stands for convergence in law or distribution. 

Similarly, r = 1 yields p cs (n, 1) ap K ox jV(I 5 13 5„ ) or equivalently, 



Vn(/> os (n,l)-A*o*(l)) /135n / , 1 



c 



^ V 7 V A»<».l)-ij-^<M>. 

The finite sample variance and skewness of p PE (n, r) and p cs (n, r) may be derived analytically in much 
the same way as was asymptotic variances. In particular, the variance of h\2 for proportional-edge PCD is 



■ 'rjxy, 

a£»I(r G [4/3,3/2))+^(r)I(r G [3/2,2)) + W j£(r)I(r G [2,oo)) 



Upji (r)=VarM=^(r)I(r€ [M/3))+ 

.l,2/_\ Tt„ r- \A In o /n\\ I , ,1.3/„\ T/„ r- TO /O 1\\ I , .1,4 



where 



u pe ( r ) 



1.2 / \ 



!.3 /• \ 



-(1369 r 8 + 4107 r 7 + 902 r 6 - 78084 r 5 + 161784 r 4 - 182736 r 3 - 23328 r 2 + 155520 r - 55296) 

11664 (r + 2)(r + l)r 2 

1369 r 7 + 4107 r 6 + 9650 r 5 - 98496 r 4 + 132624 r 3 - 79056 r 2 - 57888 r + 72576 

11664 (r + 2)(r + l)r ' 
r 10 + 3 r 9 - 62 r 8 + 968 r 6 - 1704 r 5 - 1824 r 4 + 5424 r 3 - 1168 r 2 - 3856 r + 2208 

16 (r + 2)(r + l)r 4 ' 

3r 3 + 3r 2 + 3r-13 
r 4 (r + l) 



Figure 5: Varf/iia] = lj pe (t) as a function of r for r G [1, 10] (left) and Var[/ii2] = u; cs (r) as a function of 
r G (0,10] (right). 



In Figure [5] (left) is the graph of w PE (r) for r G [1, 10]. Note that ui(r = 1) = 2627/11664 : 
linv-^oo w PE (r) = (at rate O (r~ 2 )), sup re r li00 j ui PE (r) w .6796 which is attained at r ps 1.66. 



.2252 and 




Figure 6: Var[ft, 12 ] terms for relative density of proportional-edge PCDs (solid line) and central similarity 
PCDs (dashed line) as a function of the expansion parameters. The vertical lines indicate the endpoints of 
the intervals in the piccewise definition of the functions. 



Moreover, the variance of h\i for central similarity PCDs is 



-(t 3 +7i 



-5r-15 )t a 



9t+3 

-(2 r 4 + ll t 3 +9t 2 -33t-81)t 2 

9(r+3)(2r+5) 
2 (22t 4 + 151t 3 +244t 2 + 12t-15)t 

(t+2) 2 (2t+1) 2 (t+3)(2t+5) 



for re [0,1/2), 
for re [1/2,1), 
for r e [1, oo). 



In Figured] (right) is the graph of w cs (r) for r e [1, 10]. Note that uj cs (t) is a continuous function 



of r with linv^o w ( r ) = and lu(t = 1) = 23/63 w .3651. Furthermore, lim T _ 



. (r) =0 (at rate 



O (r 2 )), sup r€ ( w cs (r) w .60 which is attained at r w 2.94. The variances Var[/ii2], w PB (r) and 
oj cs (r) are plotted together in Figurc[6l Observe that uj cs (t) > to PE {t) for 1 < t < 1.165) and £ > 2.09; and 
u PE (t) > u) as (t) for 1.165 < t < 2.09 

In fact, the exact distribution of p PE (n, r) is, in principle, available by successively conditioning on the 
values of Xi. Alas, while the joint distribution of h\2, his is available, the joint distribution of {/iij}i< ? <j< ?l , 
and hence the calculation for the exact distribution of p PE (n,r), is extraordinarily tedious and lengthy for 
even small values of n. The same holds for the the exact distribution of p as (n,T). 

Figure [7] indicates that, for r = 2, the normal approximation for the relative density of proportional-edge 
PCD is accurate even for small n (although kurtosis may be indicated for n = 10). Figure [8] demonstrates, 
however, that severe skewness obtains for small values of n and extreme values of r. 



Figure 7: Depicted are the distributions of p PE (n,2) ap K ox AT(|, jff^) for n = 10,20, 100 (left to right). 
Histograms are based on 1000 Monte Carlo replicates. Solid curves represent the approximating normal 
densities given in Theorem 13.31 Note that the vertical axes arc differently scaled. 




Figure 8: Depicted are the histograms of relative density for 10000 Monte Carlo replicates of p PB (10, 1) 
(left) and p PE (10, 5) (right) indicating severe small sample skewness for extreme values of r. 



Figure IH1 indicates that, for r = 1, the normal approximation for the relative density of central similarity 
PCD is accurate even for small n (although kurtosis and skewness may be indicated for n = 10, 20). Figure 
[TU1 demonstrates, however, that the smaller the value of r, the more severe the skewness of the probability 
density. 



3.3 The Multiple Triangle Case 

In this section, we present the asymptotic distribution of the relative density in multiple triangles. Suppose 
y rn = {yi,y2, • ■ • ,y m } C I 2 be a set of m points in general positio n with m > 3 and n o more than three 



points are cocircular. As a result of the Delaunay triangulation of y m (jOkabe et all (|2000fi ). there are J m > 1 
Delaunay triangles each of which is denoted as Tj . The Delaunay triangles partition the convex hull of y m . 
We wish to investigate 

H :X t ~U{C H {y m )) fori = l,2,...,n (13) 

against segregation and association alternatives (see Section 2]). Figure ITT1 (middle) presents a realization of 
1000 observations independent and identically distributed as U(CH(y m )) for m = 10 and J m = 13. 

For J m > 1 (i.e., m > 3), as in Section [2l let p PE (n,m,r) = \A\ /{n (n — 1)) be the relative density for 
the proportional-edge PCD in the multiple triangle case. Let p cs (n,m,r) and P cs (j) be defined similarly 
for the central similarity PCD. Let n, be the number of X points in Tj for i = 1,2, ...,J m . Letting 




Figure 9: Depicted are p cs (n, 1) ap K ox J\f [1 } for n = 10, 20, 100 (left to right). Histograms are based 

on 1000 Monte Carlo replicates. Solid curves represent the approximating normal densities given in Theorem 
13.41 Note that the vertical axes are differently scaled. 




Figure 10: Depicted are the histograms for 10000 Monte Carlo replicates of p cs (10, 1) (left), p cs (10, 2.5) 
(middle), and p cs (10, 10) (right) indicating severe small sample skewness for extreme values of r (i.e., r = 1 
or t = 10). 




Figure 11: Realization of segregation (left), H : CSR (middle), and association (right) for \y m \ = 10. 



Wi = A(Ti) / A(C h (y m )) with A(-) being the area function and W = {wi, w 2 , ■ ■ ■ , wj m }, we obtain the 
following as a corollary to Theorems 13.31 and 13.41 



Corollary 3.5. For r G [l,oo], the asymptotic distribution for p PE (n,m,r) conditional on W is given by 

Vn(p PB (n,m,r) -Ji PE (m,r)) — > Af (0,Av PE (m,r)) , (14) 
asn-yoo, where Jl PE (ni,r) = p PE (r) [J2i=i w i) an( ^ 



u PE (m,r) 



Jm. \ J m 



2 



E™ 3 - E w 

i=i 



with H PE (r) and v PE {r) being as in Equations §E§ and respectively. The asymptotic distribution of 
pcs( n i m , T ) with t G (0, 00] is similar. 

Proof: The expectation of p PE (n, m, r) is 

E [p PE (n,m,r)] = - 1 E E E NM] = E [h 12 (r)} /2 = P(X 2 G JW*i,r)) = £ PE (r). 

^ ' i<3 

By definition of N P e(-, r), if Xi and X 2 are in different triangles, then P(X 2 G N P e{Xi, r)) = 0. So by the 
law of total probability 



Jl PE (r) := P(X 2 G NpEtX^r)) = E e ^(^1, r) | {Jf x> X 2 } C Ti)P({X!,X 2 } c T t ) 

?:=i 

= E^( r ) P ({ Xl ' X2 > C T ^ ( sincc G N PE (X x ,r) I {X l5 X 2 } c Ti) - n PE (r)) 



i=l 



Cpe 0) E ^ 



\i=l / 

where p PE {r) is given by Equation (|5J|. 

Likewise, we get ju cs (r) = /i cs (r) ^X)f=i where p cs (r) is given by Equation 
Furthermore, the asymptotic variance is 

v PE (m,r) = E[h 12 h 13 ] - E[h 12 ]~E[h 13 } 

= P({X 2 ,X 3 } c iVp E (^,r))+2P(X 2 G N PE (X 1 ,r),X 3 G r[ B (Xi,r)) 
+P({X 2 ,X 3 } C I\ £ (Xi,r)) -4(M P£ (m,r)) 2 . 

Let P 2 » := P({X 2 ,X 3 } C JW*i,»0), P^(r) := P({X 2 ,X 3 } C r^^i.r)), and P p ^(r) := 
P(X 2 G NpsiXur^Xa G T^p^r)). Then for J m > 1, we have 



P({X 2 ,X 3 } C N PE {X 1 ,r)) = J2P({X 2 ,X 3 } c JVp B (A:i,r)|{Xi,Xa,X3} C 7)) P({X U X 2 , X 3 } c 7) 

3=1 

= E p p^w( A ( T i)M(^«)) 3=p -w IE/ 



tr 3 

3=1 Vj'=l 



Similarly, P(X 2 g N PE (X 1 ,r),X 3 G J^' (X^r)) = P£(r) (££i«$) and P({* 2 ,X 3 } C rf* (*!,»•)) 
^(r) fE-=i^f)> hence, 



? PB (m,r) = (P^(r)+2Pi f +P^W) (|>°) "^(m.r) 2 = v PE (r) (|>f ) +4 MpB (rf ^E^-(E^) 

Likewise, we get £ cs (r) = zy cs (r) (^"i ™t) + 4 Mcs( T ) 2 fe^i ™? - (e/=i w i) J ■ 

So, conditional on W, iiv PE (r) > 0, then y/n (p PE (n,m,r) — (J> PE (r)) — > N (0, v PE (r)). A similar result 
holds for the relative density of the central similarity PCD. ■ 

By an appropriate application of the Jensen's inequality, we see that Ei^Ti ivf > ^Eiii wf^j . So the 

covariance above is zero iff v PE (r) = and J2i=i w i = (E/=i w i^j > so asymptotic normality may hold even 
though v PE (r) = in the multiple triangle case. That is, p PE (n, m, r) has the asymptotic normality for 
r = 00 also provided that E^*i io| > ^E£?l w i^j ■ The same holds for r = 00 in the central similarity case. 

4 Alternative Patterns: Segregation and Association 

In a two class setting, the phenomenon known as segregation occurs when members of one class have a 
tendency to repel members of the other class. For instance, it may be the case that one type of plant does 
not grow well in the vicinity of another type of plant, and vice versa. This implies, in our notation, that 
Xi arc unlikely to be located near elements of y m . Alternatively, association occurs when members of one 
class have a tendency to attract members of the other class, as in symbio t ic spe cies, so that Xj will tend to 
cluster around the elements of y m , for example. See, for instance. iDixon (|l994h and lCoomes et all (| 19991) . 



These alternatives can be parametrized as follows. In the one triangle case, without loss of generality 
let y 3 = {(0,0),(l,0),( Cl ,c 2 )} and T b = T{y 3 ) with yi = (0,0), y 2 = (1,0), and y 3 = (ci,c 2 ). For the basic 
triangle T b , let Q := {x G T b : d(x,y 3 ) < 6} for 8 G (0, (c? + c|)/2] and S{F) be the support of F. Then 
consider 

JF S ■= {F : S(F) C T b and P F (X G Qe) < Pu(X G Qe)} 

and 

ie A := {F : S(F) C T b and P F (X G Qe) > P V {X G Qe)} 

where Pp and Pjj are probabilities with respect to distribution function F and the uniform distribution 
on T b , respectively. So if Xi ^ F G Ms, the pattern between class X and y points is segregation, but if 
Xi % ri F G M'a , the pattern between class X and y points is association. For example the distribution family 

&s : = {F '■ S(F) C T b and the associated pdf / increases as d(x,y 3 ) increases} 

is a subset of Ms ancl yields samples from the segregation alternatives. Likewise, the distribution family 

'■= {F : S(F) C T b and the associated pdf / increases as d(x,y 3 ) decreases} 

is a subset of M'a and yields samples from the association alternatives. 

In the basic triangle, T b , we define the alternatives Hf and with e G (0, -\/3/3), for segregation and 
association alternatives, respectively. Under , 4e 2 /3 x 100 % of the area of T b is chopped off around 
each vertex so that the X points are restricted to lie in the remaining region. That is, for yj G 3^3, let ej 
denote the edge of T b opposite vertex \/j for j = 1, 2, 3, and for x G T b , let £j{x) denote the line parallel to ej 

2 £ 2 ^2 £ 

through x. Then define Tj(s) — {x G T b : d(yj,£j(x)) < Sj} where E\ — — , e 2 = — , 

3V C 2 + (1 - c i) 2 3Vc? + c| 

and e 3 = Let % := \J 3 j=1 T 3 {e). Then under iff, we have X. t ~ U (T b \ %)■ Similarly, under H^, we 

3 J 

have Xi ~ U ( 7y3/ 3 _ e J . Thus the segregation model excludes the possibility of any Xi occurring around a 



Figure 12: An example for the segregation alternative with a particular expansion parameter e (shaded 
region), and its complement is for the association alternative with expansion parameter v3/3 — £ (unshaded 
region) on the standard equilateral triangle. 



Yj, and the association model requires that all Xi occur around y 3 's. The v3/3 — e is used in the definition 
of the association alternative so that e = yields H a under both classes of alternatives. Thus, we have the 
below parametrization of the distribution families under the alternatives. 



% s :={F:F= U(T b \ %)} and % A := {F : F = U{T^ /3 _ e )}. (15) 

Clearly % s C .Jtf> s and ^4, 3 _ E C ,W A , but % s £ & s and ^ /3 _ £ g ^A- 

These alternatives J?^ a nd ff^ with £ 6 (0, y3/3), can be transformed into the equilateral triangle as 
in lCevhan et~aTI (|2006l ) and lCevhan et al.l (|2007t ). 

For the standard equilateral triangle, in Tj(e) = {x G T e : d(y,£j(x)) < £<,}, we have £i = £2 = £3 = £• 
Thus Hi implies X, ~ U (T e \ %) and be the model under which X, ~ W (V^^) . Sec Figure [TJ for 
a depiction of the above segregation and the association alternatives in T e . 

Remark 4.1. The geometry invariance result of Theorem 1 3 . 1 1 also holds under the alternatives and H A for 
both PCD families. In particular, the segregation alternative with e G (0, \/3/4) in the standard equilateral 
triangle corresponds to the case that in an arbitrary triangle, kx 100% of the area is carved away as forbidden 
from the vertices using line segments parallel to the opposite edge where k = 4e 2 (which implies kg (0, 3/4)). 
But the segregation alternative with e G (\/3/4, V3/3) in the standard equilateral triangle corresponds to 
the case that in an arbitrary triangle, k x 100% of the area is carved away as forbidden from each vertex using 
line segments parallel to the opposite edge where re = 1 — 4 (l — V3e) (which implies n G (3/4, 1)). This 
argument is for the segregation alternative; a similar construction is available for the association alternative. 
□ 

Remark 4.2. The Alternatives in the Multiple Triangle Case: In the multiple triangle case, the 
segregation and association alternatives, Hi and Hf with e G (0, v3/3) , are defined as in the one-triangle 
case, in the sense that, when each triangle (together with the data in it) is transformed to the standard 
equilateral triangle as in Theorem 13.11 we obtain the same alternative pattern described above. 

Thus in the case of J m > 1, we have a (conditional) test of H : Xi l ~ U{CH{ym)) which once again rejects 
against segregation for large values of p n (r, J) and rejects against association for small values of p n (r,J). 
The segregation (with k = 1/16, i.e., e = \/3/8) , null, and association (with n = 1/4, i.e., £ = \/3/12) 
realizations (from left to right) are depicted in Figure [TT] with n = 1000. □ 



4.1 Asymptotic Normality under the Alternatives 

Asymptotic normality of relative density of the PCDs under both alternative hypotheses of segregation and 
association can be established by the same method as under the null hypothesis. Let Ef [•] (E^[-]) be the 



expectation with respect to the uniform distribution under the segregation ( association ) alternatives with 
£ G (0,\/3/3). 

Theorem 4.3. 

(i) Let /j,p E (r,e) be the mean E^ [/112] and u p (r,e) be the covariance, Covf [/ii2> his] for r G [l,oo] and 

e G [0,v / 3/3) under . Then as n — > 00, \fn{o PB (n, r) — /u^ (r, e)) — — > A/"(0, 1^ (r, e)) /or £/ie 
values of (r,e) for which u p (r,e) > 0. A similar result holds under association. 

(ii) Let pf, (r, e) fee i/ie mean Ef [ft.12] and ^ (r,e) 6e i/ie covariance, Cov g [/112, /113] /or r € (0,oo] and 

e G [0,\/3/3) under . Then as n — > oo, y / n(p os (n,r) — /i^ E (r, e)) — ► AT(0, (t, e)) for the 
values of (r, e) /or which v s (r, e) > 0. ^4 similar result holds under association. 



A sketch of the proof of part (i) is provided in (jCevhan et al.l (|2006h ). and of part (ii) for r G (0, 1] is 
provided in (jCevhan et al. I (|2007t )). The proof of part ( ii) for t G (1, 00) is similar. 

The explicit forms of p,f E (r,e) and (i PE (r,e) are given, defined piecewise, in (jCevhan et al.l (j2004bh V 
Note that under i/^, 

> for ( r > £ ) e [l, V3/(2e)) x (0,73/4] |J [l,V3/e - 2) x (V3/4, V3/3) , 

and under H^, 

i/£,(r,e)>0far (r,e) e (l,oo) x (o, y/3/a) \J{1} x (0,^3/12^ 



The explicit forms of /i^ g (r, e) and A 4 ^ (r, e) are given, defined piecewise, in (jCevhan et al. ( 2004a )). Note 
that under 



i&Cr.e) > for (t,s) G (0,oo) x (0, 3 v/3/lo] |J f 2 ^ , 00 



x (3V3/10, V3/3), 



and under H^, 



v£ s (t,e) > for (r,e) G (0,oo] x (0, V3/3). 



Notice that under association alternatives any r G [1, 00) yields asymptotic normality for relative density 
of proportional-edge PCD for all e G (0,^/3/3), while under segregation alternatives, only r = 1 yields 
this universal asymptotic normality. Furthermore, under association alternatives any r G (0, 00) yields 
asymptotic normality for relative density of central similarity PCD for all e G (0, \/3/3). The same holds 
under segregation alternatives. 

The asymptotic normality also holds under the alternatives in the multiple triangle case. For example, 
for the relative density of proportional-edge PCDs, the asymptotic mean and variance are as in Corollary 
OSS with (J. PE {r) (y PB (r)) being replaced by (J.f E {r,e) {v pE (r,e)) for segregation and by (J. pE {r,e) {vf E (r,s)) 
for association. 

and 



4.2 The Test Statistics and Analysis 

The relative density of the PCD is a test statistic for the segregation/association alternative; rejecting for 
extreme values of p PE (n, r) is appropriate since under segregation we expect p PE (n, r) to be large, while 
under association we expect p PE (n, r) to be small. 

In the one triangle case, using the standardized test statistic 

RpEir) = i (16) 

the asymptotic critical value for the one-sided level a test against segregation is given by 

z a = <Z>- 1 (l-a) (17) 



where $(■) is the standard normal distribution function. Against segregation, the test rejects for Rpe{t) > z a 
and against association, the test rejects for Rpe{t) < Zx- a - The same holds for the standardized test statistic 

• il u- 1 4. • l 5 f \ \/n[Ppis( n : r )-'iZpis( r )) 

m the multiple triangle case, RpE(r) = — , s — '-■ 

A similar construction is available for p cs (n, r) with 

Rcs(t) = (18) 

in the one triangle case, and with Rcs( T ) = ^^^t'^, ^cs( T )) m ^ e multiple triangle case. 

4.3 Consistency 
Theorem 4.4. 

(i) In the one triangle case, the test against which rejects for Rps{r) > zi- a and the test against 
which rejects for RpE{r) < z a are consistent for r £ [1, oo) and e <G (0, V3/3) . The same holds in the 
multiple triangle case with RpE(r). 

(ii) In the one triangle case, the test against which rejects for Rcs{ T ) > Z\—a an d the test against 

which rejects for Rcs{ T ) < z a are consistent for r € (0, oo) and e € (0, V3/3) . The same holds in the 
multiple triangle case with Rcs(j~). 



For the one triangle c ase, the proof of (i) i s provided in ( Cevhan et al. ( 20061 )) and the proof of (ii) with 
r e (0, 1] is provided in ( Cevhan et al. ( 2007t )). The proofs for the multiple triangle cases and for (ii) with 
r > 1 are similar. 



5 Empirical Size Analysis 

5.1 Empirical Size Analysis for Proportional-Edge PCDs under CSR 

In one triangle case, for the null pattern of CSR, we generate n X points iid U(T e ) where T e is the standard 
equilateral triangle. We calculate the relative density of proportional-edge PCDs for r = 1, 11/10, 6/5, 4/3, y/2, 
3/2, 2, 3, 5, 10 at each Monte Carlo replicate. We repeat the Monte Carlo procedure N mc = 10000 times for 
each of n = 10, 50, 100. Using the critical values based on the normal approximation for the relative density, 
we calculate the empirical size estimates for both right-sided (i.e., for segregation) and left-sided (i.e., for 

association) tests as a function of the expansion parameter r. Let RpE{r)(r,j) := V< "^ FE ^'''' , '| ^p E ( r )) ^ c 

the standardized relative density for Monte Carlo replicate j with sample size n for j = 1,2,..., N mc . For 
each r value, the level a asymptotic critical value is [i PE (r) + Z(i_ a ) ■ V ' V PE ('') l n - We estimate the empirical 
size against the segregation alternative as j^—^2f = ^i{RpE( r ){ r ij) > z i~a), an d against the association 
alternative as j^— Ylj=x I (RpE{r)(r,j) < z a ). The empirical sizes significantly smaller (larger) than .05 are 
deemed conservative (liberal). The asymptotic normal approximation to proportions is used in determin- 
ing the significance of the deviations of the empirical sizes from .05. For these proportion tests, we also use 
a = .05 as the significance level. With N mc = 10000, empirical sizes less than .0464 are deemed conservative, 
greater than .0536 are deemed liberal at a = .05 level. The empirical sizes for the proportional-edge PCDs 
together with upper and lower limits of libcralness and conservativeness are plotted in Figure 1131 Observe 
that as n increases, the empirical size gets closer to the nominal level of 0.05 (i.e., the normal approximation 
gets better). For the right-sided tests (i.e., relative to segregation) the size is close to the nominal level for 
r G (2,3), for smaller r values (i.e., r < 2) the test seems to be liberal with liberalness increasing as r de- 
creases; and for larger r values (i.e., r > 3) the test seems to be conservative with conservativeness increasing 
as r increases. For the left-sided tests (i.e., relative to association) the size is close to the nominal level for 
r E (1.5, 3), for other r values the test seems to be liberal (more liberal for smaller r values). This is due to 
the fact that very large and small values of r require much larger sample sizes for the normal approximation 
to hold. 
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Figure 13: Empirical size for Rpe(t) in the one triangle case: The empirical size estimates of the 
relative density of proportional-edge PCDs in the one triangle case based on 10000 Monte Carlo replicates 
for the left-sided alternative, i.e., relative to segregation (top) and the right-sided alternative, i.e., relative 
to association (bottom) with n = 10 (left column), n = 50 (middle column), and n = 100 (right column) 
under the CSR pattern. The horizontal lines are located at .0464 (upper threshold for conservativeness), 
.050 (nominal level), and .0536 (lower threshold for liberalness) . 

In the multiple triangle case, for the null pattern of CSR, we generate n X points iid U(Ch (3^io)) where 
3^io is the set of the 10 class y points given in Figure [TTJ With N mc = 1000, empirical sizes less than .039 
are deemed conservative, greater than .061 are deemed liberal at a = .05 level. The empirical sizes for the 
proportional-edge PCDs together with upper and lower limits of liberalness and conservativeness are plotted 
in Figure [TU Observe that in the multiple triangle case (which is more realistic than the one triangle case) 
the empirical sizes are much closer to the nominal level compared to the one triangle case. For the right-sided 
alternative (i.e., against segregation), the size is about the nominal level for r £ (1.5,3), and for the left- 
sided alternative (i.e., against association), the size is about the nominal level for r £ (1.1,2). Furthermore, 
although the empirical sizes for both right- and left-sided alternatives are about the desired level for r values 
between 1.5 and 2, it seems that they are not very far from the nominal level for r £ (1.5,10). The test 
seems to be liberal for the right-sided alternative and conservative for the left-sided alternative, if not at the 
desired level. 

5.2 Empirical Size Analysis for Central Similarity PCDs under CSR 

In one and multiple triangle cases, data generation is as in Section 15.11 and we compute the relative den- 
sity of central similarity PCDs for r = 0.2, 0.4, 0.6. 3.0, 3.5, 4.0, 20.0 at each Monte Carlo repli- 
cate. Let RnsMiT.i) := — ( pcs ^"' T,: '- ) ^csjlj) ^ ^ nc standardized relative density for Monte Carlo 

replicate j with sample size n for j = 1, 2, . . . , N mc . For each r value, the level a asymptotic critical 
value is ^ cs (r) + zt\_ a \ ■ yjv cs (r)/n. We estimate the empirical size against the segregation alternative as 

JT~ Y^j=\ I (Rcsi T )( T : j) > z i-a) and against the association alternative as j^— Y^f=i I (^cs( T )( T i j) < z a) 
In one triangle case, the empirical sizes for the central similarity PCDs together with upper and lower limits 
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Figure 14: Empirical size for Rpe{t) in the multiple triangle case: The empirical size estimates 
of the relative density of proportional-edge PCDs in the multiple triangle case based on 1000 Monte Carlo 
replicates for the right-sided alternative (i.e., relative to segregation) (top) and the left-sided alternative (i.e., 
relative to association) (bottom) with n = 500 (left column) and n = 1000 (right column) under the CSR 
pattern. The horizontal lines are located at .039 (upper threshold for conservativeness), .050 (nominal level), 
and .061 (lower threshold for liberalness) . 



of liberalness and conservativeness are plotted in Figure [15] Observe that as n increases, the empirical size 
gets closer to the nominal level of 0.05 (i.e., the normal approximation gets better). For the right-sided 
tests, the size is close to the nominal level for r <G (5, 14) and closest to 0.05 for r w 5 or r <E (7, 9) for all 
sample sizes; for smaller r values (i.e., r < 4.5) the test seems to be liberal with liberalness increasing as r 
decreases; and for r > 15 the test is extremely conservative with size being virtually for n = 10 and the 
test is slightly conservative for n = 50 and 100. For larger n (i.e., n > 50), the test has the desired size 
for r > 4. Considering all sample sizes, we recommend r £ (5, 10) for testing against segregation. For the 
left-sided tests with n — 10 the size is close to the nominal level for r £ (2,4). For n = 50 the test has the 
desired size for r £ (2, 10) and for n = 100 the test has the desired size for r £ (2, 15). With all sample sizes, 
the test seems to be conservative (slightly liberal) for smaller (larger) r values. Considering all sample sizes, 
we recommend r £ (2.5, 5) for testing against association. The range of appropriate r values gets wider 
with the increasing sample size and very large and small values of r require much larger sample sizes for the 
normal approximation to hold. 

In the multiple triangle case, the empirical sizes for the central similarity PCDs are plotted in Figure [TBI 
Observe that in the multiple triangle case the empirical sizes are much closer to the nominal level compared 
to the one triangle case. Furthermore, for the right-sided alternative with n — 500, the test has the desired 
level for r £ (.8,4), t rs 5, and r £ (12,20) and with n = 1000 for r > 2. Considering all sample sizes, we 
recommend r £ (2.5, 8) for testing against segregation. For the left-sided alternative, with n = 500, r > .5 
(except r = 7 or 11) seems to yield the appropriate level and with n = 1000, r > .5 seems to yield the 
appropriate level. Considering all sample sizes, we recommend r £ (0.5, 20) for testing against association. 

Remark 5.1. Empirical Size Comparison for the Two PCD Families: In the one triangle case, the 
size estimates for the central similarity PCD is close to the nominal level of 0.05 against the segregation 
alternative for more of the expansion parameter values considered. On the other hand, the size estimates 
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Figure 15: Empirical size for Rcs( T ) m the one triangle case: The empirical size estimates of the 
relative density of central similarity PCDs in the one triangle case based on 10000 Monte Carlo replicates 
for the left-sided alternative (i.e., relative to segregation) (top) and the right-sided alternative (i.e., relative 
to association) (bottom) with n = 10 (left column), n = 50 (middle column), and n = 100 (right column) 
under the CSR pattern. The horizontal lines are as in Figure IT3l 



against association are close to the nominal level for both PCD families, but it seems that the size estimates 
for central similarity PCD is closer to the nominal level. In the multiple triangle case, the size performance 
of the two PCD families is similar and the size estimates are close to the nominal level for both of the 
alternatives. □ 



6 Empirical Power Analysis under the Alternatives 

To compare the power performance of the test statistics under the alternatives, we generate n X points 
uniformly in the corresponding support sets as described in Section |4] and provide the empirical power 
estimates of the tests under the segregation and association alternatives. 

6.1 Empirical Power Analysis for Proportional-Edge PCDs under the Segrega- 
tion Alternative 

In the one triangle case, at each Monte Carlo replicate under segregation , we generate Xi ~ U (T e \ T e ), 
for i = 1, 2, . . . , n for n = 10, 50, 100. At each Monte Carlo replicate, we compute the relative density of the 
proportional-edge PCDs. We consider r € {1, 11/10, 6/5,4/3, \/2> 3/2, 2,3, 5, 10} for the proportional-edge 
PCD. We repeat the above simulation procedure N mc = 10000 times. We consider e G {vo/8, \/3/4, 2 V3/7} 
(which correspond to 18.75 %, 75 %, and 4500/49 « 91.84 % of the triangle (around the vertices) being 
unoccupied by the X points, respectively) for the segregation alternatives. 

Under segregation alternatives with e > 0, the distribution of p PE (n, r) is degenerate for large values 
of r. For a given e g (0,v / 3/4), the corresponding digraph is complete almost surely, for r > hence 





Figure 16: Empirical size for Rcsij) in the multiple triangle case: The empirical size estimates 
of the relative density of central similarity PCDs in the multiple triangle case based on 1000 Monte Carlo 
replicates for the left-sided alternative (i.e., relative to segregation) (top) and the right-sided alternative (i.e., 
relative to association) (bottom) with n = 500 (left column) and n = 1000 (right column) under the CSR 
pattern. The horizontal lines are as in Figure [14] 



p PE (n, r) = 1 a.s. For e £ (\/3/4, \/3/3), the corresponding digraph is complete almost surely, for r > ^ £ 2s . 
In particular, for e = a/3/8, p PE (n,r) is degenerate for r > 4, for e = \/3/4, p PE {n,r) is degenerate for 
r > 2, and for e = 2 \/3/7, p PE (n, r) is degenerate for r > 3/2, 

In the one triangle case, we plot the kernel density estimates for the null case and the segregation 
alternative with e = v3/8 and e = v / 3/4 with n = 10 and N mc = 10000 in Figure IT7l Observe that 
under both H a and alternatives, kernel density estimates are almost symmetric for r = 3/2. Moreover, there 
is much more separation between the kernel density estimates of the null and alternatives for e = \/3/4 
compared to e = v3/8, implying more power for larger e values. In Figure [T8l we present a Monte Carlo 
investigation against the segregation alternative f° r r = H/10, and n = 10, N mc = 10000 (left), 

n = 100, N mc = 1000 (right). With n = 10, the null and alternative kernel density functions for /?io(ll/10) 
are very similar, implying small power. With n = 100, there is more separation between null and alternative 
kernel density functions implying higher power. Notice also that the probability density functions are more 
skewed for n = 10, while approximate normality holds for n = 100. 

For a given alternative and sample size, we analyze the empirical power of the test based on p PB (n, r) — 
using the asymptotic critical value — as a function of the expansion parameter r. We estimate the empirical 
power as j^— Y^j^i I (RpE(r){r, j) > Z\- a ). In Figure [19j we present Monte Carlo power estimates for 
relative density of proportional-edge PCDs in the one triangle case against £T^ g , H^^ 4 , and H^^. 7 as 
a function of r for n = 10, 50, 100. Notice that Monte Carlo power estimate increases as r gets larger and 
then decreases, due to the magnitude of r and n. Because for small n and large r, the critical value is 
approximately 1 under H Q , as we get a complete digraph with high probability. Moreover, the more severe 
the segregation, the higher the power estimate at each r. Under mild segregation (with e = y/3/8), r around 
1.5 or 5 yields the highest power (for other r values, the power performance is very poor). Furthermore, 
under moderate to severe segregation, with n = 10 the power estimate seems to be close to 1 for r £ (1,4), 
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Figure 17: Kernel density estimates of the relative density of proportional-edge PCD, p PE (n,r), under the 
null (solid line) and the segregation alternatives (dashed line) with (left) and (right) for r = 3/2 

with n = 10 based on N mc = 10000 replicates. 




relative density relative density 



Figure 18: Depicted are kernel density estimates for p PE (n, 11/10) for n — 10 (left) and n = 100 (right) 
under the null (solid line) and segregation alternative H^. g (dashed line). 

and with n = 50 or 100 the power estimate seems to be close to 1 for r G (1, 5). However, the power estimates 
are valid only for r within (2,3), since the test has the desired size for this range of r values against the 
right-sided alternative. So, for small sample sizes, r ss 1.5 is recommended, and for larger sample sizes, 
moderate values of r (i.e., r G (2,3)) are recommended for the segregation alternative as they are more 
appropriate for normal approximation and they yield the desired significance level. 

In the multiple triangle case, we generate the X points uniformly in the support for the segregation 
alternatives in the triangles based on the 10 class y points given in Figure QTJ We use the parameters 
e G {a/3/8, V3/4, 2 y/3/7}. We compute the relative density based on the formula given in Corollarv l3.5l The 
corresponding empirical power estimates as a function of r (using the normal approximation) are presented 
in Figure [20] for n = 500 or 1000. Observe that the Monte Carlo power estimate increases as r gets larger 
and then decreases, as in the one triangle case. The empirical power is maximized for r G (1.5,2) under 
mild segregation, and for r G (1.5,3) under moderate to severe segregation. Considering the empirical size 
estimates, r w 1.5 is recommended under mild segregation, while r G (2,3) seems to be more appropriate 
(hence recommended for more severe segregation) since the corresponding test has the desired level with 
high power. 

6.2 Empirical Power Analysis for Central Similarity PCDs under the Segrega- 
tion Alternative 

In the one triangle case, data generation is as in Section [67X1 At each Monte Carlo replicate we compute 
the relative density of the central similarity PCDs. We consider r G {0.2, 0.4, 0.6, . . . , 3.0, 3.5, 4.0, . . . , 20.0} 
for the central similarity PCD. We repeat the simulation procedure N mc = 10000 times. Under segregation 
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Figure 19: Empirical power for Rpe{t) in the one triangle case: Monte Carlo power estimates for 
relative density of proportional-edge PCDs in the one triangle case using the asymptotic critical value against 
segregation alternatives H^, & (top row), H^.^ (middle row), and H^^. 7 (bottom row) as a function of 

r, for n = 10 (left column), n = 50 (middle column), and n = 100 (right column). 



alternatives with e > 0, the distribution of p cs (n, r) is non-degenerate for all r S (0, oo) and s € (0, \/3/3). 

In the one triangle case, we plot the kernel density estimates for the null case and the segregation 
alternative with e = \/3/8 and e = \/3/4 with n = 10 and N mc = 10000 in Fi gure 1211 Observe that under 
both H a and alternatives, kernel density estimates are almost symmetric for r = 1. Moreover, there is much 
more separation between the kernel density estimates of the null and alternatives for e = \/3/4 compared 
to e = a/3/8, implying more power for larger e values. In Figure [221 we present kernel density estimates for 
the null case and the segregation alternative H^i 4 for r = 0.5, and n = 10, N mc = 10000 (left), n = 100, 

N mc = 1000 (right). With n = 10, the null and alternative kernel density functions for p cg (10, 0.5) are very 
similar, implying small power. With n = 100, there is more separation between null and alternative kernel 
density functions, implying higher power. Notice also that the probability density functions are more skewed 
for n = 10, while approximate normality holds for n = 100. 

We estimate the empirical power as j^— J^j^i I (Rcs( T ){ T i j) > Zi-a)- In Figure[23j we present Monte 
Carlo power estimates for relative density of central similarity PCDs in the one triangle case against 

an< ^ "^2%/3/7 as a f unc ti° n 01 r f° r n = 10: 50, 100. Notice that Monte Carlo power estimate increases 
as r gets larger or n gets larger. Moreover, the more severe the segregation, the higher the power estimate 
at each r. With n = 10, the power estimates are high for r € (5, 14) and virtually for r > 14. With n = 50 
or 100, the power values are high for r > 1, with highest power being attained around r rs 8. However, for 
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Figure 20: Empirical power for Rpe{t) in the multiple triangle case: Monte Carlo power estimates for 
proportional-edge PCDs in the multiple triangle case using the asymptotic critical value against segregation 
alternatives (left column), (middle column), and (right column) as a function of r. for 

n = 500 (top) and n = 1000 (bottom). 
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Figure 21: Kernel density estimates of the relative density of central similarity PCD, p cs (n,r), under the 
null (solid line) and the segregation alternatives (dashed line) with (left) and (right) for r = 1 

with n = 10 based on N mc = 10000 replicates. 
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Figure 22: Depicted are kernel density estimates for p cs (n, 0.5) for n = 10 (left) and n = 100 (right) under 
the null (solid line) and segregation alternative H^. 4 (dashed line). 



t > 6, the power values are virtually same. Considering the empirical size estimates, we recommend t rj 8 
for mild segregation, and t « 5 for more severe segregation alternatives. 

In the multiple triangle case, data generation is again as in Section [6. II We compute the relative density 
based on the formula given in Corollary 13.51 The corresponding empirical power estimates as a function of 
t (using the normal approximation) are presented in in Figure [M] for n = 500 and 1000. Observe that the 
Monte Carlo power estimate tends to increase as r gets larger. Under mild segregation with e = V3/8, the 
empirical power is large for r > 2 with largest being around t € (4, 8). Under moderate to severe segregation, 
the empirical power is virtually one for r > 0.4. Considering the empirical size estimates, r ss 7 seems to 
be more appropriate (hence recommended for segregation) since the corresponding test has the desired level 
with highest power. 



6.3 Empirical Power Analysis for Proportional-Edge PCDs under the Associa- 
tion Alternative 

In the one triangle case, at each of N mc = 10000 Monte Carlo replicates under association H^, we generate 

iid ( \ 

Xi ~ U ( Tyg/3_ e J , for i = 1, 2, . . . , n for n = 10, 50, 100. The relative density is computed as in Section 

16.11 Unlike the segregation alternatives, the distribution of p PE (n, r) is non-degenerate for all e € (0, V3/3) 
and r € [l,oo). We consider e € {5 V3/24, n/3/21} (which correspond to 18.75 %, 75 %, and 

4500/49 w 91.84 % of the triangle being occupied around the y points by the X points, respectively) for the 
association alternatives. 

In the one triangle case, we plot the kernel density estimates for the null case and the association 
alternative with e = \/3/21 and e = 5 v3/24 with n = 10 and N mc = 10000 in Figure l25l Observe that 
under both H and alternatives, kernel density estimates are almost symmetric for r = 3/2. Moreover, 
there is more separation between the kernel density estimates of the null and alternatives for e = 5 \/3/24 
compared to e = v3/21, implying more power for larger e values. In Figure [2"ol we present kernel density 
estimates for the null and the association alternative ^^g^ 12 f° r r = 11/10, and n = 10, N mc = 10000 (left), 
n = 100, N mc = 1000 (right). With n = 10, the null and alternative kernel density functions for pi (ll/10) 
are very similar, implying small power. With n = 100, there is more separation between null and alternative 
kernel density functions implying higher power. Notice also that the probability density functions are more 
skewed for n = 10, while approximate normality holds for n = 100. 

Under association, for each r value, the level a asymptotic critical value is fj, PE (r)z a ■ y/v PE (r) /n. We 
estimate the empirical power as j^— ^2^=1 I {RpE(r)(r,j) < z a ). In Figure [27l we present Monte Carlo 
power estimates for relative density of proportional-edge PCDs in the one triangle case against -f^/j^' 
^V3/i2' an< ^ ^V3/2i as a mnc ti° n °f r f° r n = 10, 50, 100. Notice that Monte Carlo power estimate increases 
as r gets larger and then decreases, as in the segregation case. Because for small n and large r, the critical 
value is approximately one under H 0l as we get a nearly complete digraph with high probability. Moreover, 
the more severe the association, the higher the power estimate at each r. Highest power is attained for r « 2, 
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Figure 23: Empirical power for Rcs{ T ) m the one triangle case: Monte Carlo power estimates for 
relative density of central similarity PCDs in the one triangle case using the asymptotic critical value against 
segregation alternatives H^. g (top row), (middle row), and H^^. 7 (bottom row) as a function of 

t, for n = 10 (left column), n = 50 (middle column), and n = 100 (right column). 



which is recommended against the association, as it yields the desired level with high power. 

In the multiple triangle case, we generate the X points uniformly in the support for the association 
alternatives in the triangles based on the 10 class y points given in Figure [TT] We use the parameters e <G 
{5 \/3/24, y/3/12, \/3/21}. We compute the relative density based on the formula given in Corollary [23] The 
corresponding empirical power estimates as a function of r (using the normal approximation) are presented 
in Figure [28] for n = 500 or 1000. Observe that the Monte Carlo power estimate decreases as r gets larger 
unlike the the one triangle case. The empirical power is large (i.e., close to one) for r € (1,5). Considering 
the empirical size estimates, we recommend r 2 for association alternative since the corresponding test 
has the desired level with high power. 

6.4 Empirical Power Analysis for Central Similarity PCDs under the Associa- 
tion Alternative 

In the one triangle case, we generate data as in Section 16.31 and compute the relative density as in Section 
16.21 The distribution of p cs (n,r) is non-degenerate for all e € (0, v^/6) and r <G (0, oo). 

In the one triangle case, we plot the kernel density estimates for the null case and the association 
alternative with e = V3/12 and e = 5^3/24 with n = 10 and N mc = 10000 in Figure H3 Observe that 
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Figure 24: Empirical power for Rcs(i~) in the multiple triangle case: Monte Carlo power estimates for 
central similarity PCDs in the multiple triangle case using the asymptotic critical value against segregation 
alternatives (left column), (middle column), and H^^^ (right column) as a function of r, for 

n = 500 (top) and n = 1000 (bottom). 
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Figure 25: Kernel density estimates of the relative density of proportional-edge PCD, p PE (n, r), under the 
null (solid line) and the association alternatives (dashed line) with H^^ 21 (left) and H^^. 2i (right) for 

r = 3/2 with n = 10 based on N mc — 10000 replicates. 
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Figure 26: Depicted are kernel density estimates for p PE (n, 11/10) for n = 10 (left) and n = 100 (right) 
under the null (solid line) and association alternative H^/ 12 (dashed line). 

under both H a and alternatives, kernel density estimates are almost symmetric for r = 1. However, there is 
only mild separation between the kernel density estimates of the null and alternatives implying small power. 

In Figurc|30l we present a Monte Carlo investigation against the association alternative H^. 21 for r = 1, 

and n = 10, N mc = 10000 (left), n = 100, N mc = 1000 (right). With n = 10, the null and alternative kernel 
density functions for p cs (n, 1) are very similar, implying small power. With n = 100, there is more separation 
between null and alternative kernel density functions, implying higher power. 

Under association, we estimate the empirical power as in Section 16.31 In Figure 1311 we present Monte 
Carlo power estimates for relative density of central similarity PCDs in the one triangle case against H^^^i' 
^/3/i2' anc ^ ^\/3/2i as a mnc ti° n °f 7" for Ti = 10, 50, 100. Under mild association and small n, highest power 
is attained around r w 3, under mild association with large n, power increases as r increases. For moderate 
to severe association and large n, power is virtually one for all r values considered. Considering the empirical 
size performance, we recommend t«5, as it has the desired level and high power. 

In the multiple triangle case, we generate data as in Section 16.31 The corresponding empirical power 
estimates as a function of t are presented in Figure [32] for n = 500 or 1000. Observe that the Monte Carlo 
power estimate tends to decrease as r gets larger. The empirical power is maximized for r < 1. Considering 
the empirical size estimates, we recommend r pa 1 for association, since the corresponding test has the 
desired level with high power. 

Remark 6.1. Empirical Power Comparison for the Two PCD Families: In the one triangle case, 
under the segregation alternatives, the power estimates of the central similarity PCDs tend to be higher 
than those of the proportional-edge PCDs. Under mild to moderate association alternatives, central simi- 
larity PCDs have higher power estimates, while under severe association, proportional-edge PCD has higher 
power estimates. In the multiple triangle case, under segregation, central similarity PCDs has higher power 
estimates, and under association, proportional-edge PCDs has higher power estimates. □ 

7 Pitman Asymptotic Efficiency 

Suppose that the distribution F under consideration may be indexed by a set 6cK and consider H D : 9 = 9q 
versus H a : 9 > 8 . 

Pitman asymptotic efficiency or efficacy (PAE) provides for an investigation of "local asymptotic power" 
— local around H a . This involves the limit as n — > oo as well as the limit as e — > 0. 

Consider the comparison of test sequences S = {S n } satisfying the following conditions in a neighborhood 

8 E [9q,9o + k] of the null parameter for some k > 0. 

Pitman's Conditions: 

(PCI) For some functions jJL n (9) and <r n (9), the distribution Fg of [S n — fJ, n (&)] /cr n (6) converges to Z ~ A^(0, 1) 




Figure 27: Empirical power for Rpe{t) in the one triangle case: Monte Carlo power estimates for 
relative density of proportional-edge PCDs in the one triangle case using the asymptotic critical value against 
association alternatives H^^ 21 (top row). H^. 12 (middle row), and _ (bottom row) as a function of 

r, for n = 10 (left column), n = 50 (middle column), and n = 100 (right column). 
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Condition (PCI) is equivalent to 



0asti-> oo. 



(PCI)' For some functions n n (6) and QV t (0), th e distribution i*e of [£„ — /x n (0 n )] /a n (9 n ) converges to a standard 
normal distribution (see lEeden ( 19631 )). 
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Figure 28: Empirical power for Rpe{t) in the multiple triangle case: Monte Carlo power estimates 
of the relative density of proportional-edge PCDs in the multiple triangle case using the asymptotic critical 
value against association alternatives H^. 21 (left column), H^^ 12 (middle column), and H^-^.^ (right 

column) as a function of r, for n = 500 (top) and n = 1000 (bottom). 
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Figure 29: Kernel density estimates of the relative density of central similarity PCD, p cs (n,r), under the 
null (solid line) and the association alternatives (dashed line) with H^^ 12 (left) and H^^.^ 4 (right) for 

r = 1 with n = 10 based on N mc = 10000 replicates. 
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Figure 30: Depicted are kernel density estimates for p cs (n, 1) for n = 10 (left) and n = 100 (right) under 
the null (solid line) and association alternative ^^g^ 21 (dashed line). 

Note that if (i n k) (6» ) > and f$ (9 ) = 0, for alU = 1, 2, . . . , k- 1, then (i' n (9 Q ) in (PC2 ) (PC3), and (PC5) 
can b e replaced by Hn\Oo) > and (i' n (9 n ) in (PC3) can be replaced by (i n (9 n ) (see iKendall and Stuartl 
jl97Sj> ). 



Lemma 7.1. (Pitman-Noether) 

(i) Let S = {S n } satisfy conditions (PC1)-(PC5). Consider testing H Q by the critical regions S n ^> n cin 
with a„ = Pg a (S n > ii tt „) — > a as n — > oo where a <G (0,1). For (3 <G (0, 1 — a) and 0„ = 9q + (n~ x / 2 ) , 
we have 

I3n(0 n ) = Pe n {T n > u an ) -> /3 iffcV^(0 n - 6) -> $- x (l - a) - $" 1 ( ( 8). 

(mJ Let 5 = {SV1} and Q = {Qn} eacA satisfy conditions (PC1)-(PC5). Then the asymptotic relative 
efficiency of S relative to Q is given by ARE(S,Q) = (cs/cq) 2 . 

Thus, to evaluate ARE(S,Q) under the conditions (PC1)-(PC5), we need only calculate the quantities 
cs and cq, where 

cs = Inn —= j—r and cq = lim 



n->oo y/n ■ as n (0o) n^oo y/n ■ o-Q n (9 ) 

PAE(S) = c s is called the Pitman Asymptotic Efficiency (PAE) of the test based on S n . Using similar 
notation and terminology for Q n . 

ARE{s > Q) -pAE(Qy 

A detailed discussion of PAE can be found in Kendall and Stuart ( 19791 ) and Eedenl (1963). 



(/i (fc) (re=0)) 2 

Under segregation or association alternatives, the PAE of p PE (n, r) is given by PAE(r) = ^ ' , » ' 

where k is the minimum order of the derivative with respect to e for which (i^ k \r,e = 0) =/= 0. That is, 
p^(r,e = 0) ^ but p^(r,e = 0) = for I = 1, 2, . . . , k - 1. Similarly, the PAE of p cs (n,r) is given 

by PAE(t) = ^'^'(^ where k is the minimum order of the derivative with respect to e for which 

(i^(r, £ = 0)^0. 

7.1 Pitman Asymptotic Efficiency for Proportional-Edge PCDs under the Seg- 
regation Alternative 

Consider the test sequences p PE (r) = {p PE (n, r)} under segregation alternatives for sufficiently small e > 
and r G [l,V3/(4e)). In the PAE fr amework above, the parameters are 9 = e and 9q = 0. Suppose 
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Figure 31: Empirical power for Rcs{ T ) m the one triangle case: Monte Carlo power estimates for 
relative density of central similarity PCDs in the one triangle case using the asymptotic critical value against 
association alternatives H^^i row ): ^^3/12 ( m iddlc row), and H^^.^ (bottom row) as a function of 
t, for n = 10 (left column), n = 50 (middle column), and n = 100 (right column). 



H 8 PB (r,e) = Ef[p PE (n,r)]. For s e [0,^3/8), 



4* 



(r,e) = ^2w hj (r,e)I(r elj) 



3=1 



with the corresponding intervals X x = [1,3 /2 - V3e), X 2 = [3/ 2 - V3e,3/2), I 3 = [3/2,2 - 4e/\/3), 
I 4 = [2-ie/VE,2),X 5 = [2,V3/(2e)). See ICevhan et al.1 (<2004bh for the explicit form of fif E (r, e) and for 
derivation. Notice that as e -> 0, only X x = [l,3/2 - y/3e), I 3 = [3/2,2 - 4e/Vf), X 5 = [2,y/3/(2e)) do 
not vanish, so we only keep the components of fi pE (r, e) on these intervals. 

Furthermore, cr 2 s (n,e) = Varf (p PE (n, r)) = 2n } x) Varf [h 12 ] + J 



n (n — 1) pe 



(r,e), with v p Jr,e) 



Cov* [h\2, his]. The explicit forms of Var* [/112] and Cov^ [hi2, /113] are not calculated, since we only need 
lim„_ i . 00 Un(e = 0) = v PE (r) which is given in Equation (|9|). 



(Il978l) 



Notice that |/ii 2 | 3 < 8 < 00 and Ef [/i i2 /113] — Ef [/112] 2 = Covf [/ii 2 , /113] > then bv lCallaert and Janssen 



sup te i 



>,e) 



< i 



*(t) 



< CEf |^ 12 | 3 [v s pE 



power est. under H A 3 2| with n=500 power est. under H* with n=500 power est. under 4 with n=500 
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Figure 32: Empirical power for Rcs( T ) m the multiple triangle case: Monte Carlo power estimates 
of the relative density of central similarity PCDs in the multiple triangle case using the asymptotic critical 
value against association alternatives H^, 21 (left column), 2 (middle column), and 4 (right 

column) as a function of t, for n = 500 (top) and n = 1000 (bottom). 



where C is an absolute constant and <!>(•) is the standard normal distribution function. Then (PCI) follows 
for each r € [1,^3/(2 e)) and e € [0, a/3/4). 

Differentiating /i pE (r, e) with respect to e yields 



(A&J'fa e) = w' M (r, e)l(re [l, 3/2 - V3 e)J + < 3 (r, e) I (r e [3/2, 2 - 4 e/Vz)^ 

+ o7i i6 (r,e)l(re [2,V3/(2e))) 



where 



2 e (144 e 2 (r 2 - 1) + 36 - 37 r 2 ) 
27 (2e- l) 3 (2e + l) 3 : 



w' 13 (r,e)= 2v / 3((2r-3)64e 3 + (7i 



24 r + 20) 16 \/3e 2 + (r - 3) 48 e + 3 V% r 4 + 96 i 



9(2e+ l) 3 (2e- 1)V 

8^6 (48e 3 + (3 r 4 + 3r 2 - 20) 4\/3V + 366 + 9^3 -9\/3r 2 ) 
27r 2 (2e+l) 3 (2e- l) 3 



Since {u s „„)'{r,e = 0) = 0, we need higher order derivatives for (PC2). A detailed discussion is available 
in dKendall and Stuartl l(l979li ). 

Differentiating {n PB )' (r,e) with respect to e yields 



(4J'(r,s) = w'i x {r,e)l(r G [1,3/2- v/3e)) +^ , )3 (r j£ )I (r E [3/2,2 -U/Vs) 

+ < >6 (r,e)l(re [2,V3/(2e)) 



where 



^i,i( r > £ ) = - 



2 (r 2 - 1) 1728 e 4 + (72 - 77 r 2 ) 4e 2 + 36 - 37 r 2 



27(4e 2 -l) 4 

(2 r - 3) 512 V3 e 5 + (20 + r 4 + 7 r 2 - 24 r) 576 e 4 + (2 r - 3) 1024 VSs 3 + (20 - 108 r 2 



96r + 9r 4 )36e 2 + (-3 + 2r) 96 V3 e - 108 + 9 r 4 - 180 1 



288 r 



9r 2 (2e + l) 4 (2e- l) 4 



w'l 5 (r, e) = -8 [l28 Vis 5 + (-20 + 3 r 4 + 3 r 2 ) 48 e 4 + 256 V3 e 3 + (-5 - 12 r 2 + 3 r 4 ) 12 e 2 + 24e V3 + 9 



9r' 



/[9r 2 (2e + l) 4 (2e- l) 4 



Thus, 



8 , 74 2 
3 ' 27 



( M f V'(r )£ = 0) 



for re [1,3/2), 

2 (^-4r+2)(r-+4r-6) foj . r g [3/ 2> 2 ), 

for re [2,V5/(2e)). 



(19) 



8(i-r 2 ) 



Observe that (jtf E )"(r,e = 0) > for all r 6 [l, V3/(2e)), so (PC2) holds with the second derivative. (PC3) 
in the second derivative form follows from continuity of (p PE )" (r, e) in e and (PC4) follows from continuity 
of cr 2 (r, e) in e. 



Next, we hnd c?_ (r) = lim^oo £- E , 



, where numerator is given in Equation (11911 

and denominator is given in Equation (|9|). We can easily see that c pE {r) > 0, since c pE (r) is increasing in 
r and c^ B (r = 1) > 0. Then (PC5) follows. So under segregation alternatives Hf , the PAE of p PE (n,r) is 
given by 



PAEp B (r) 



((Q"(r, £ = 0))' 

" PE ( r ) 





Figure 33: Pitman asymptotic efficiency against segregation (left) and association (right) alternatives as a 
function of the expansion parameters in the one triangle case for the relative density of proportional-edge 
PCDs (solid line) and central similarity PCDs (dashed line). 

In Figure [33] (left), we present the PAE as a function of the expansion parameter for segregation. Notice 
that PAE^ E (r = 1) = 160/7 22.8571, lim,.^ PAE| B (r) = oo. Based on the PAE analysis, we suggest, for 
large n and small s, choosing r large for testing against segregation. However, for small and moderate values 
of n, normal approximation is not appropriate due to the skewness in the density of p PE (n,r). Therefore, 
for small n, we suggest moderate r values. 



7.2 Pitman Asymptotic Efficiency for Central Similarity PCDs under the Seg- 
regation Alternative 

Consider the test sequences P cs (t) = {p cs (n, r)| for sufficiently small e > and t e (0, oo). In the 
PAE framework above, the parameters are 9 = e and 8q = 0. Suppose, pf. At, e) = Ef [p cs (n, r)]. For 



g [0,V3/5), 



with the corresponding intervals Xi = (0, 1 — v^e), X 2 = [l— v^e, l), and I3 = [1, 00). See Appendix 2 for 
the derivation of /xf s (r, e) for r > 1 and Appendix 3 for the explicit form of /x^ (r, e) for r G (0, 1]. Notice 
that as e — ► 0, only X\ and Z3 do not vanish, so we only keep the components of \l s (r, e) on these intervals. 

Furthermore, a\{e) = Varf (p cs (n, r)) = 2w ( ^ t _ 1) Varf [/t 12 ] + „ ( "~-i) Cov f i h i2, ^13]- The explicit forms 
of Varf[/ii2] and Covf [/112, /113] are not calculated, since we only need lirrin^oc ct 2 (£ = 0) = i^ cs (r) which 
is given in Equation (j 12[) . 

Notice that Ef | 1 3 < 8 < 00 and Ef[h 12 h 13 ] - Ef [/112] 2 = Covf [/i 12) /113] > then (PCI) follows for 
each t g (0, 00) and e g [0, y/3/3). 

Differentiating xtf (r, e) with respect to £ yields 



where 



and 



(/xf s ) , (T, £ )=^ )1 (r )£ )l(rg (0,1-V3 £ )J + n / 1)3 (r,e)I(r g [l,oo)) 

ier 2 (5£ 2 r-9e 2 -3T + 3) 



ro 'i.i( T : £ ) = 



9(1 -2e)3(2e+ 1)3(1 - T ) 



8 r, 



ro' u (T,E) = - ^2r 5 £ 2 +21r 4 e 2 +116r 3 £ 2 +48%/3r 2 £ 3 +37T 2 £ 2 +96 V3r e 3 -18 t 3 +36 V3t 2 £-200t£ 2 +48 V3e 3 
45 r 2 + 72^3x6- 132 e 2 -36r + 36 VZe - 9) re] /[(2t+ 1) (r + 2) (r + l) 2 (2e+ l) 3 (2e- 1) S 



hence (/x* )'(r, £ = 0) = 0, so we need higher order derivatives for (PC2). Differentiating (/x£ )'(r, e) with 
respect to £, we get 

(/if s )"(r, e) = ro^r, e) I (r g (0, 1 - yfte)) + w'! tS (r, e) I (r g [1, 00)) 



where 



(20- 



36 j 



15r« 



Hi 



r + 1 



3(r-l)(2£-l) 4 (2£ + l) 4 



1 r 5 £ 4 +84 r 4 £ 4 +2 r 5 £ 2 +464 t 3 £ 4 + 128 V / 3r 2 £ 5 +21 r 4 £ 2 +148 rV+256 V3t £ 5 -4 r 3 £ 2 



256v^T 2 £ 3 -800r£ 4 + 128v / 3£ 5 -263r 2 £ 2 +512 v / 3r£ 3 -528£ 4 -6r 3 +24V3T 2 £-440r£ 2 +256v / 3£ 3 -15r 2 - 



48\/3t 



192 ( 



12 t + 24 V3e - 3)r] / [(2 r + 1) (r + 2) (r + l) 2 (2 e + l) 4 (2 e - i f 



Hence 



(/XM"(T,£=0) 



for t g (0, 1), 
for re[l,oo). 



(20) 



Observe that (/xf s )"(T,£ = 0) > for all r g (0, 00), so (PC2) holds with the second derivative. (PC3) 
in the second derivative form follows from continuity of (/xf s )"(r, e) in £ and (PC4) follows from continuity 
of cr 2 (r, e) in e. 



(C) "(T,e=0) 0C S ) "(T,e=0) . . . 

" — — — where numerator is given m Equation (|20l) 



Next, we find c^„(r) = lim„^ c 
and denominator is given in Equation (fT2"|) . We can easily see that cf (r) > 0, then (PC5) follows 



So under segregation alternatives Hf , the PAE of p cs (n,r) is given by 

((/xf s )"(r,e = 0)) 2 



PAE& s (r) 



In Figurc[33](left), we present the PAE as a function of r for segregation. Notice that lim r _>o PAE^. s (r) = 
320/7 « 45.7143, argsup re(0iOo) PAe£ s (t) = 1.0 with PAE^ s (t = 1) = 960/7 « 137.1429 and lim^oc PAe£ s (t) 
oo. Moreover a local maximum occurs at r = 1 and a local minimum occurs at r w 1.62 with PAE score 
w 112.70. Based on the PAE analysis, we suggest, for large n and small e, choosing r large for testing against 
segregation. However, for small and moderate values of n, normal approximation is not appropriate due to 
the skewness in the density of p cs (n, r) for extreme values of r. Therefore, for small n, we suggest moderate 
t values (i.e., r ps 7 or 8). 

Comparing the PAE scores of the relative density of proportional-edge PCDs and central similarity PCDs 
under segregation alternatives, we see that PAE PB (t) < PAE^ s (t) for 1 < t < 1.093); and PAEp B (t) > 
PAE^ s (i) for t > 1.093. Therefore, under segregation alternative, overall, relative density of proportional 
edge PCD is asymptotically more efficient compared to the central similarity PCD. Furthermore, PAEp B (t) 
tends to oo as t — y oo at rate 0(t 2 ), while PAE^. s (t) tends to oo as t — > oo at rate 0(t). 



7.3 Pitman Asymptotic Efficiency for Proportional-Edge PCDs under the As- 
sociation Alternative 

Consider the test sequences p PE {r) = {p PE (n,r)} for sufficiently small e > and r £ [l,oo). In the 
PAE framework above, the parameters are 8 = e and 8q = 0. Suppose, fi^ (r,e) = ~E e {p PE (n,r)}. For 
e £ [0, (7 v / 3-3v / 15) /12 « .042), 

6 

i=i 

with the corresponding intervals T\ = [l, (l + 2 V3e) / (l — V3e)), Xi = [(l + 2 V3e) / (l — V3s) , 
4 (1- V3e)/3),I 3 = [4 (l-V3e)/3,4 (l + 2 y/3e) /3), J 4 = [4 (l + 2^3 e) /3, 3/(2 (l-\/3e)))> J 5 = 
[3/(2 (l — \/3e)),2) and T§ = [2,oo). Notice that as e — > 0, only X, for j = 2, 4, 5 , 6 do not vanish, so we 
only keep the components of p PE (r, e) on these intervals. See Cevhan et al. (|2004bl ) for the explicit form of 
Pp E (r,e) and for derivation. 

Furthermore, o 2 n {e) = Var^(p PE (n, r)) = 2 „ ( 4 _ ^ Var^ [h 12 }+ „ ( "~_ 2 j) Cov^ [h 12 ,h 13 ] whose explicit form 
is not calculated, since we only need lim n _ ! . 0O y/nu n (e = 0) = v PE (r) which is given in Equation (|9|). 
(PCI) follows for each r £ [1, oo) and e £ [0, v^/3) as in the segregation case. 
Differentiating fj,^ E (r,e) with respect to e, we get 

(j4J(r> £ ) = <2(r, e) I(r £ [1, 4/3)) + w[ A (r, e) I(r £ [4/3, 3/2)) 

+ t S 7i iB (r > e)I(rG[3/2,2)) + t J 7i i6 (r > e)I(re[2,oo)) 

where 



m' 12 (r,e) 



V3I-H52: 



.4_3 



720 V3r 4 e 2 - 288 r 4 e + 11 V3r 4 + 2592 v^rV - 10368 V3re 2 



6e+V3) (6e+Vt\ r 2 



+ 432 V3r 2 +6480 v^e 2 - 864 + 432 Vsje 
™i,i( r > e ) = - 2 v / 3(-H52r 4 e 3 + 720 \/3r 4 £ 2 - 288 r 4 e + 11 V3 r 4 - 1296\/3r 2 e 2 + 108 V3r 2 

- 2160 Vie 2 - 144 Vs)e j (-6e + Vtj 3 (6e + Vtj 3 r 2 , 
2 e (3 r 4 - 72 r 2 - 240 e 2 + 192 r - 124) 



r 2 (12e 2 - l) 3 



40 e 



r 2 (12e 2 - l) 2 ' 

Hence (pj E Y(r,e = 0) = 0, so we differentiate (p PE )'(r, e) with respect to e and get 

(^J>, £ ) = < 2 (r, £ )I(r£ [1,4/3)) + < 4 (r, £ ) I(r £ [4/3,3/2)) 

+ ts7i / fB (r J e)I(r £ [3/2, 2)) + ^(r, e) I(r £ [2,oo)) 



where 



V3 (^-27648 r 4 e 5 + 25920 v^rV - 18432 r 4 e 3 + 2820 VSr 4 e 2 + 93312 v^rV 
- 576 r 4 e - 373248 V3re 4 + 11 V3 r 4 + 33696 V3r 2 e 2 + 233280 V3e 4 - 82944 re 2 
+ 432 V3 r 2 + 45360 Vie 2 - 864 %/3r + 432 v^) / ^e + v^) (*-6£ + V3 



^",4 (?",£) = - 6 



v/3 ( -27648 r 4 e 5 + 25920 v^^e 4 - 18432 r 4 e 3 + 2820 v^rV - 46656 V^rV 
2592 %/3 r 2 e 2 - 77760 V3e 4 + 108 V3r 2 - 15120 \/3e 2 - 144 Vs 



576r 4 e + l\Vlr A 



6e + V3) (-6e + V3 



< 5 (r, £ ) 
^1,6 ( r > £ ) 



2 (180 rV + 3 r 4 - 4320; 



.2 ,.2 



8640 e 4 + 11520 re 2 - 72 r 2 - 8160 e 2 + 192 r - 124) 



r 2 (12e 2 - l) 4 



Thus, 



40(36e 2 + l) 
r 2 (12e 2 -l) 3 ' 



(^J"(r, £ = 0) 



-f r 2 + 192 r" 1 - 96 r~ 2 - 96 for r£ [1,4/3), 

fr 2 +32r~ 2 -24 for re [4/3,3/2), 

6r 2 - 384r~ 1 + 248r~ 2 + 144 for re [3/2,2), 

40 r- 2 for r E \2,oo). 



(21) 



Note that (p- A E )"(r,e — 0) < for all r e [l,oo), so (PC2) follows with the second derivative. (PC3) and 
(PC4) follow from continuity of (p A )"(r, e) and cr 2 (r, e) in e. 



Next, we find c A (r) = lim ra _ 



•°° v/S(T n (r,E=0) 



, by substituting the numerator from Equation 



- PB v, -— ™ ^<r,(r,£=0) v ^ ( r ) ' 

(|2T|) and denominator from Equation (|9]). We can easily see that c A E (r) < 0, for all r > 1. Then (PC5) 
holds, so under association alternatives i?^ 1 , the PAE of p PE (n 7 r) is 



PAE^(r) = (c^(r)) 2 = 



((^j>, £ = o)y 



In Figure [33] (right), we present the PAE as a function of r for association. Notice that PAEp £ (r = 1) = 
174240/17 « 10249.41, lim,-**, PAE^ £ (r) = 0, argsup re[1>oo) PAE^(r) « 1.01 with supremum w 10399.77. 
PAEp B (r) has also a local supremum at n « 1.44 with local supremum « 3630.89. Based on the Pitman 
asymptotic efficiency analysis, we suggest, for large n and small e, choosing r small for testing against 
association. However, for small and moderate values of n, normal approximation is not appropriate due to 
the skewncss in the density of p PB (n,r). Therefore, for small n, we suggest moderate r values. 



7.4 Pitman Asymptotic Efficiency for Central Similarity PCDs under the As- 
sociation Alternative 



Consider the test sequences P cs {t) = {p as {n, t)} for sufficiently small e > and t € (0, oo). In the 
PAE framework above, the parameters are 9 = e and 9q = 0. Suppose, p A s (r, e) = Eg [p cs (n, t)]. For 
s e [0,73/21), 



with the corresponding intervals X\ = 



0, 



2 (l— v/3s) 



24 = 



3 V3e 



1-\/3e' 1-4 V3e 



2r = 



3_y/3j 



6 %/3s 



1-4 V3e' 1-\/3e 



2fi = 



X 2 = 

6 VSe 



3 V3£ 2 y^s 

2 (l--v/3e) ' 1-2 v^e 

6 V3 



1-V3e' 1-4 V3e 



,^7 



I 3 = 
6 \/3 



2\/3e 



3 ygj 



1-4 V3e 



1-2 v^e' l-y/3e, 

, 1 ), andJ 8 = [l,oo). 



Notice that as e — > 0, only the intervals Z7 and Is do not vanish, so we only keep the component of p A s (r, e) 
on these intervals. See Section [10] for the explicit form of fj,(r,e). 



w[JT,e) = -72 



Furthermore, cr^(e) = Var^(p cs (n, r)) = 2 n («-i) Var ^ foiz] + j) Cov^ [fe i2 , /ti 3 ] whose explicit form 
is not calculated, since we only need lim n _ ! . 0O y/na n (e = 0) = v cs (t) which is given Equation (fT2")l . 
(PCI) follows for each r € (0, oo) and e G [0, a/3/3) as in the segregation case. 
Differentiating fi^ (t, e) with respect to e, we get 

04)'(r, e) = ^i.rfr I (re (0, l)) + w' lfi {r, e) I (r G [1, oo)) 

where 

" /3(-360 tV+198 V3t 4 £ 2 -900 t 3 £ 3 -90 t 4 £+495 \/3r 3 e 2 -360 tV+4 V3t 4 -225 r 3 e+ 
126\/3T 2 e 2 + 10\/3r 3 -90r 2 £-387%/3T£ 2 +10%/3r 2 -126\/3e 2 )e|/[(2T+ 1) (r + 2) (-6 £ + Vz) 3 (e e + V3 
and 

, , , £ (69t£ 2 + 18e 2 -6t- 2) 

w < ( T e) = -24— i 4- 

(t + 2)(2t+1)(12£ 2 -1) 3 

Hence (Mcs)'( T ' £ = 0) = 0, so we differentiate (//^ )'(r, e) with respect to £ and get 

TO ;' 7 (r,£) = -216 [v / 3(-8640t 4 £ 5 +7128V3t 4 £ 4 -21600t 3 £ 5 -5760t 4 £ 3 +17820 v / 3t 3 £ 4 -8640 t 2 £ 5 +834 V3t 4 £ 2 - 
14400 t 3 £ 3 +4536%/3t 2 £ 4 -180t 4 £+2085v^t 3 £ 2 -5760t 2 £ 3 - 13932 \/3t£ 4 +4\/3t 4 -450t 3 £ + 978V3t 2 £ 2 - 
4536%/3£ 4 +10%/3r 3 -180r 2 £-1161\/3r£ 2 +10\/3T 2 -378V3£ 2 ) j (2 r + 1) (r + 2) (-6 £ + \ft\ ^ e + Vt 4 

and 

„ , . „ 2484t£ 4 + 648£ 4 -153t£ 2 -66£ 2 -6t-2 

zu"Jt,e) = 24 j . 

(r + 2)(2r+l)(12£ 2 -l) 4 

Thus 

[ (2r+l)(r+2) f ° r rG[l,Oo). 

Note that (/z4,)"(t, £ = 0) < for all r G (0, oo), so (PC2) follows with the second derivative. (PC3) and 
(PC4) follow from continuity of //'(t, e) and ct 2 (t, e) in £. 

(M^ 3 )"(r, E =0) = u^y 

VSff„(T,E=0) V^CS ( T ) 

>n (fi"2"l) . We can easily 
(PC5) holds, so under association alternatives Hf, the PAE of p cs {n 1 r) is 

, 2 



Next, we find c4j(t) = lim„_ i . 0O _ _j = ^^ T '^ ; by substituting the numerator from Equation 

22|) and denominator from Equation (fT2]l. We can easily see that c£ (t) < 0, for all t € (0,oo). Then 



PAE^(r) = (c^(r)) 



((^ s )"(r, £ = 0))' 



In Figurc[33] (right), we present the PAE as a function of r for association. Notice that lim T _j.o PAE cs (r) = 
72000/7 « 10285.71 which is also the global maximum, PAE£ s (t = 1) = 61440/7 « 8777.14 which is also a 
local maximum. Moreover, a local minimum of PAE^ s (t) occurs at r sa .45 with PAE score being equal to 
w 6191.67. Based on the Pitman asymptotic efficiency analysis, we suggest, for large n and small e, choosing 
r small for testing against association. However, for small and moderate values of n, normal approximation 
is not appropriate due to the skewness in the density of p cs (n, r). Therefore, for small n, we suggest r « 1. 

Comparing the PAE scores of the relative density of proportional-edge PCDs and central similarity PCDs 
under association alternatives, we see that PAE^(t) < PAE£ s (t) for 1 < t < 1.4564) and for t > 1.5192); 
and PAEp E (t) > PAEp S (£) for 1.4564 < t < 1.5192. Under association, relative density of central similarity 
PCD is asymptotically more efficient compared to the proportional-edge PCD. Furthermore, PAEp E (t) goes 
to as t — >• oo at rate 0(t~ 2 ), while PAE^ iS (t) goes to oo as t — > oo at rate 0(i _1 ). 



Remark 7.2. Hodges-Lehmann Asymptotic Efficiency: PAE analysis is local (around e = 0) and 
for arbitrarily large n. The comparison would hold in general provided that fj,(r, e) is convex in e for all 
s G [0, a/3/3). As an alternative, we fix an e under segregation alterna tive and then compare the asymptotic 



behavior of p PE (n,r) with Hodges-Lehmann asymptotic efficiency in (jCevhan et al.l (|2004bl )) 



Hodges-Lehmann asymptotic efficiency (HLAE) of p cs (n, r) is given by 

HLAE(t, e) := i^,e) - » s (t))\ 

Unlike PAE, HLAE does only involve n —> oo at a fixed e > 0. Hence HLAE requires the mean and, 
especially, the asymptotic variance of p cs (n, r) under a fixed alternative. So, one can investigate HLAE for 
specific values of e, if not for all e € (0, V3/3). □ 

Remark 7.3. The asymptotic power function allows investigation as a function of the expansion parameter, 
n, and e using the asymptotic critical value and a n appeal to normality. The asymptotic power functions of 
p PE (n,r) under the alternatives is investigated in ( Cevhan et al. ( 2004bl )V 



Under a specific segregation alternative , the asymptotic power function of p cs (n, r) is given by 

z (i- a ) ■ \J v cs (r) + v 7 ™ (p-cs 0) " Vcs ( T > £ )) 



n s (n,r,e) = 1 - $ 



Under , we have 



7.5 Pitman Asymptotic Efficiency Analysis in the Multiple Triangle Case 

For J m > f (i.e., m > 3), in addition to the expansion parameter, PAE analysis depends on the number 
of triangles as well as the relative sizes of the triangles (i.e., on y m ). So the optimal expansion parameter 
values with respect to the PAE criteria in the multiple triangle case might be different than that of the one 
triangle case. 

Given the values of J m and W, under segregation alternative Hf , the PAE for the relative density of 
proportional-edge PCDs is given by 

^^'- '^r 1 ' - — - — - — - 

PAE score for the relative density of proportional-edge PCDs under the association alternative is similar. 

Similarly, the PAE for the relative density of central similarity PCDs under segregation alternative 
is given by 

PAE g>„ T) =(fen 7 , r 0,)^ (" 4)> ' £ = ° )E ^) 2 „ (24) 

"°* (m ' T| ^ME/: 1 -| + 4feM 2 (ESr.» ) 3 -(Et 1 -»j) ) 

PAE score for the relative density of central similarity PCDs under the association alternative is similar. 

In Figure [M] (left), we present the PAE scores as a function the expansion parameter under segregation 
alternative conditional on the realization of y m given in Figure Iff! Notice that, unlike the one triangle 

case, PAE S (m,r) is bounded with lim^^ PAE S (m,r) = , — 8 Ej5 w 3 _ f39.34. Some val- 

peV ' ' r ^°° peV ' ' 256 (E/S %M£/=i ) 

ues of interest are PAE (m,r = 1) « .39, and a local maximum value of « 110.97 is attained at the 
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Figure 34: Pitman asymptotic efficiency against segregation (left) and association (right) alternatives as 
a function of expansion parameters in the multiple triangle case with J m = 13 for the relative density of 
proportional-edge PCDs (solid line) and central similarity PCDs (dashed line). Notice that vertical axes are 
differently scaled. 

argsup re j 1 2 j PAE^ E (m,r) ~ 1.97. On the other hand, the PAE curve for the central similarity PCDs in 
the multiple triangle case is similar that in the one triangle case (See Figure [33J (left)). But unlike the 
one triangle case, PAE^ s (to, t) is bounded with linv^oo PAE^ S (to, r) w 139.34. Some values of note are 
lim r ^o PAE^ S (to, r) 38.20, and a local maximum of « 100.77 is attained at argsup T£ ( 2 ) PAE^ 9 (m, r) 
1 : and a local minimum of w 75.97 is attained at arginf T6 / 13 \ PAE^ S (m, r) « 2.04. Based on the PAE 
analysis of the relative density of proportional-edge PCDs, under segregation alternative larger r values have 
larger asymptotic relative efficiency. However, due to the skewness of the pdf of p PE (m,r), moderate r 
values (r around 1.5 or 2) are recommended. As for the central similarity PCDs, larger r values have larger 
asymptotic relative efficiency. However, due to the skewness of the pdf of p os {m, t), moderate t values (r 
around 1) are recommended. 

Comparing the PAE scores for proportional-edge and central similarity PCDs under the segregation 
alternative, we sec that for 1 < t < 1.45 asymptotic relative efficiency of relative density of central similarity 
PCDs is larger since PAE^ S (to, t) > PAE^ B (m, t), and for t > 1.45 asymptotic relative efficiency of relative 
density of proportional-edge PCDs is larger since PAE~_ (to, t) < PAE^ E (m, t). Therefore, proportional-edge 
PCD tends to be more asymptotically efficient compared to the central similarity PCD under segregation. 

In Figure [34] (right), we present the PAE scores as a function the expansion parameter under association 
alternative conditional on the realization of y m given in Figure HT1 Notice that, as in the one triangle case, 
PAE A (to, r) tends to as r — > oo. Some values of interest are PAE A (m,r = 1) « 422.96, and a global 
maximum value of « 1855.97 is attained at r = 1.5 On the other hand, the PAE curve for the central 
similarity PCDs in the multiple triangle case is similar to the one in the one triangle case (See Figure [33] 
(left)). Note that lim r _> PAE^ s (to, r) « 8593.97; a local maximum value of w 6449.54 is attained at r = 1; 
and a local minimum value of « 5024.22 is attained at r w 0.49. Moreover, lirriT-^oo PAE^ s (to, t) = at 
rate 0(r -2 ). Based on the PAE analysis for relative density of proportional-edge PCDs, smaller r values 
tend to have larger asymptotic relative efficiency. However, we suggest, for large n and small e, choosing 
moderate r for testing against association due to the skewness of the density of p cs (n, r) for very small r 
values. 

Comparing the PAE scores for proportional-edge and central similarity PCDs under the association 
alternative, we see that for t > 1 asymptotic relative efficiency of relative density of central similarity 
PCDs is larger since PAE^ s (to,£) > PAE^ e (to,£). Therefore, central similarity PCD tends to be more 
asymptotically efficient compared to the proportional-edge PCD under association. 

Remark 7.4. Empirical Power Comparison versus PAE Comparison for the Two PCD Families: 

Notice that the finite sample performance (based on the Monte Carlo simulations) and the asymptotic 
efficiency (based on PAE scores) seem to give conflicting results. The reason for this is two fold: (i) in the 
Monte Carlo simulations, we only have a finite number of observations, and the asymptotic normality of the 
relative density of the PCDs require smaller sample sizes for moderate values of the expansion parameters, 
and (ii) PAE is designed for infinitesimal deviations from the null hypothesis (i.e., as close as possible to 
the null case), while in our simulations we use mild to severe but fixed levels of deviations. Hence, if we 



had extremely large samples, the results of our finite sample and asymptotic comparisons would agree under 
extremely mild segregation or association. 

Furthermore, when the PAE scores are compared at the optimal expansion parameters, the comparison 
results agree with that of the Monte Carlo simulation results. In particular, recall that in the one triangle 
case, the optimal parameters for proportional-edge PCDs were 1.5 and 2 and for central similarity PCDs, 
they were 8 and 5 against mild segregation and association, respectively. Under segregation, central similarity 
PCD is asymptotically more efficient, while under association proportional-edge PCD is asymptotically more 
efficient at these optimal parameters. This agrees with the conclusion of empirical power comparison. In 
the multiple triangle case, the optimal parameters for proportional-edge PCDs were 1.5 and 2 and for 
central similarity PCDs, they were 7 and 1 against mild segregation and association, respectively. Under 
both alternatives, central similarity PCD is asymptotically more efficient. In this case, only the segregation 
results are in agreement. The power estimates under association were virtually same at these optimal values 
with both PCD families. □ 

An extension of proportional-edge proximity regions and central similarity proximity reg ions to higher 



dimen sions (hence the correspo nding PCDs to data in higher dimensions) are provided in ICevhan et al 
( 20061) and lCevhan et all (|2007l) . respectively. 



8 Correction for X Points Outside the Convex Hull of y m 

Our null hypothesis in Equation ([TB")) is rather restrictive, in the sense that, it might not be realistic to 
assume the support of X being CnO^m) in practice. Up to now, our inference was restricted to the Ci/(3^ m ). 
However, crucial information from the data (hence power) might be lost, since a su bstantial proportion of X 
points, denoted 7r out , might fall outside the Cjj(y m ). A correction is suggested in ( Cevhan ( 2009bl [2010bl )') 



to mitigate the effect of 7r out (or restriction to the Cjj{y m )) on the use of the domination number for the 
proportional-edge PCDs. We propose a similar correction for the points outside the Cij(Xn) for the relative 
density in this article. 

Along this line, ICevhanl (|2009bl . | 2010bf ) estimated the 7r out values for independently generated X n and 



y m as random samples from U((0, 1) x (0,1)). The considered values were n = 100, 200, 900, 1000, 
2000, . . . , 9000, 10000 for each of to = 10, 20, ... , 50. The procedure is repeated N mc = 1000 times for each 
n, to combination. Let 7? ut be the estimate of the proportion of X points outside the Cff(iVm) which is 
obtained by averaging the 7r out value s (over n ) for ea ch m, n combination. The simulation results suggested 
that 7f out w 1.7932/m+ 1.2229/Vm (|Cevhanl (|2010bl) ). Notice that as to -> oo, 7r ou t -> 0. 



Based on the Monte Carlo simulation results, we propose a coefficient to adjust for the proportion of X 
points outside CVf(3^n), namely, 

Cch ■= signum(p ut - E[7?out]) x (p out - E[7r out ]) 2 (25) 

where signum(p ut — E[7r out ]) is the sign of the difference p out — E[7? out ] and p out is the observed and E[7r out ] ~ 
1.7932/to + 1.2229/\/to is the expected proportion of X points outside C/j (J^n, ) • For the test statistics in 
Section B~2l we suggest 

Rf E (r) := R PE (r) + \R PE (r)\ ■ C ch and R c c h s (r) := R cs (t) + \Rcs(t)\ ■ C ch (26) 

Note that this (convex hull) adjustment slightly affects the empirical size estimates under CSR of X and 
y points in the same rectangular supports, since p ou t and E[7r ou t] values would be very similar. On the 
other hand, under segregation alternatives, we expect Rp h E (r) value and p ou t — E[7r out ] to be positive, so the 
convex hull correction increases the value of i?pg(r) in favor of the right-sided alternative (i.e., segregation). 
Under association alternatives, we expect R c p E (r) value and p out — E[7? out ] to be negative, so the convex hull 
correction decreases the value of Rpe{t) in favor of the left-sided alternative (i.e., association). 



9 Example Data Set 



We ill ustrate the method on an ecological data set (namely, swamp tree data of lDixonl (l2002bl )). iGood and Whipple 
(|1982f ) considered the spatial patterns of tree species along the Savannah River, South Carolina, U.S.A. From 



this data. iDixon used a single 50m x 200m rectangular plot (denoted as the (0, 200) x (0, 50) rectan- 

gle) to illustrate his nearest neighbor contingency table (NNCT) methods. All live or dead trees with 4.5 cm 
or more dbh (diameter at breast height) were recorded together with their species. Hence it is an example 
of a realization of a marked multi-variate point patter n. The p l ot cont ains 13 different tree species, four of 
which comprising over 90 % of the 734 tree stems. See ICevhanl (|2009al ) for more detail on the data. 

In this article, we only consider the middle 50m x 55m rectangular plot from the original study area (i.e., 
the subset (95, 150) x (0, 50) of the 50m x 200m rectangular plot) and investigate the spatial interaction of 
all other tree species (i.e., other than bald cypress trees) with bald cypresses (i.e., bald cypresses are taken 
to be the y points, while all other trees are taken to be the X points; hence Delaunay triangulation is based 
on the locations of bald cypresses). The study area contains 8 bald cypress trees and 156 other trees. See 
also Figure 1551 which is suggestive of segregation of other trees from bald cypresses. 




Figure 35: The scatter plot of the locations of bald cypresses (circles o) and other trees (black squares ■) 
in the swamp tree data. The Delaunay triangulation is based on the locations of the bald cypresses. 

For this data, we find that 108 other trees are inside and 48 are outside of the convex hull of bald cypresses. 
Hence the proportion of other trees outside the convex hull of bald cypresses is p ou t = 0.3077 and the expected 
proportion is 7r out = 0.6515. Hence the convex hull correction decreases the magnitude of the raw test 
statistics. We calculate the standardized test statistics, Rpe(t), for r = 1, 11/10, 6/5, 4/3, y/2, 3/2, 2, 3, 5, 10 
values and, Rcs(t), for r = 0.2, 0.4, 0.6. . . . , 3.0, 3.5, 4.0, . . . , 20.0 values and the corresponding convex hull 
corrected versions. The p-values based on the normal approximation are presented in Figure 1361 Observe 
that with Rpe(t), the convex hull corrected version is not significant (for both the right- and the left-sided 
alternatives) at 0.05 level at any of the r values considered (only significant at 0.10 level at r between 1.4 
and 2.0 for the right-sided alternative), while the uncorrected version is significant (at 0.05 level) for r values 
between 1.4 and 2.0. On the other hand, with Rcs{t), the convex hull corrected version is significant (for 
the right-sided alternatives) at 0.05 level at r values between 0.2 and 4.0, while the uncorrected version is 
significant (at 0.05 level) for t values between 0.2 and 7. Hence, there is significant evidence for segregation 
of other trees from bald cypresses. 

We also perform a Monte Carlo randomization test as follows. First we calculate the standardized relative 
density values, denoted R P b E (r) and R'cs( T ) an d f° r the current data set, so they are observed test statistics. 
Then we randomly assign 8 of the trees as "bald cypresses" (without replacement) and the remaining trees 
as "the other trees", then calculate the test statistics (standardized relative density scores) for the other 
trees within the convex hull of the bald cypresses. We repeat this procedure 999 times. Combining the 
observed R P b E (r) and R^g(r) values with these Monte Carlo randomization test statistic values, we obtain 
1000 values. We sort these test statistics values and determine the ranks of the R P b E (r) and Rq§(t) values 
within the respective Monte Carlo randomized test statistic values. These ranks divided by 1000 (or 1000 
minus the rank divided by 1000) will yield the estimated p-values for the left-sided alternative (or the right- 
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Figure 36: The p-values based on proportional-edge PCDs (left) and central similarity PCDs (right) with 
convex hull corrected test statistics (circles connected with solid lines) and uncorrected test statistics (trian- 
gles connected with dashed lines). The horizontal lines are at and 0.05 values. Notice that the horizontal 
axes are differently scaled. 
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Figure 37: The p-values estimated by Monte Carlo randomization for proportional-edge PCDs (left) and 
central similarity PCDs (right) with convex hull corrected test statistics (circles connected with solid lines) 
and uncorrected test statistics (triangles connected with dashed lines). The horizontal lines are at and 
0.05 values. Notice that the horizontal axes are differently scaled. 



sided alternative). Here we also apply the convex hull correction as in Equation (|26[) by determining the 
proportion of other trees outside the convex hull of bald cypresses. Then we determine the estimated p- values 
for these convex hull corrected test statistic values as before. See Figure [37] for the p- values based on the 
Monte Carlo randomization tests. Observe that among the Monte Carlo randomized test statistics, none 
are significant at .05 level, but Rcs( T ) yields significant results at .10 level for some of the small r values. 
Notice the discrepancy between the significance in the original test (with the asymptotic normality) and the 
Monte Carlo randomization results. 



We also analyze the same data in a 2 x 2 NNCT with Dixon's overall test of segregation (jDixonl (|2002a[) ). 
See Table Q] for the corresponding NNCT and the percentages (observe that the row sum for live trees is 
157 instead of 156 due to ties in nearest neighbor (NN) distances). The cell percentages are relative to the 
row sums (i.e., number of other or bald cypress trees) and marginal percentages are relative to the overall 
sum. Notice that the table is not suggestive of segregation. Dixon's overall test statistic is Cd = 0.9735 
(p = 0.6146) and Ceyhan's test is Cjv = 0.1825 (p = 0.6692), both of which arc suggestive of no significant 
deviation from CSR independence. So, NNCT-analysis and our relative density approach seem to yield 
conflicting results about the spatial interaction of other trees with bald cypresses. However, NNCT and our 
relative density approach answer different questions. More specifically, NNCT-tests in this example tests the 
spatial interaction between the two tree groups, while the relative density approach only tests the spatial 
interaction of other trees with bald cypresses, but not vice versa. Furthermore, this situation is an example 
where relative density is more appropriate since there is much more other trees compared to bald cypresses. 



NN 



NN 



O.T. B.C. 


sum O.T. B.C. 




u O.T. 
baSe B.C. 


151 6 
8 


157 O.T. 
8 B.C. 


96 % 4 % 
100 % % 


95 % 
5 % 


sum 


159 6 


736 


96 % 4 % 


100 % 



Table 1: The NNCT for swamp tree data (left) and the corresponding percentages (right). O.T. stands for 
"other trees" and B.C. for "bald cypresses". 
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Figure 38: Ripley's bivariate L- function Li2(t) — t for the part of the swamp tree data we considered. Wide 
dashed lines are the upper and lower (pointwise) 95 % confidence bounds for the functions based on Monte 
Carlo simulations under the CSR independence pattern. B.C. = bald cypresses and O.T. = other trees. 



On the other hand, the NNCT tests are more appropr iate in t he case s wh ere the relative abundance of the 
two species are similar and cell sizes are larger than 5 ( Dixonl (|2002al ) and ICevharJ (|2010al )). 

To find out the level of interaction between the tree species at differen t scales (i.e., d istances between the 
trees), we also present the second-order analysis of the swamp tree data (iDiggld (2003)) using t he fun ctions 
(or some modified version of them) provided in spatstat package in R (jBaddelev and Turner! (|2005|)). We 
use Ripley's bivariate L-functions which are modified versions of his i-iT-functions. For a rectangular region 
to remove the bias in estimating K(t), it is recommended to use t values up to 1/4 of the smaller side length 
of the rectangle. So we take the values t £ [0, 12.5] in our analysis, since the rectangular region is 50 x 55 m. 

Ripley's bivariate L-function Lij(t) is symmetric in i and j in theory, that is, Lij(t) = Lji{i) for all 
In practice although edge corrections will render it slightly asymmetric, i.e., Lij{t) ^ Lu(t) for i ^ j. The 
corresponding estimates are pretty close in our example, so we only present one bivariate. Ripley's bivariate 
L-function for the bald cypresses and other trees are plotted in Figure [351 which suggests that bald cypresses 
and other trees are significantly segregated for distances about 0.5 to 7 meters, and do not significantly 
deviate from CSR for distances from 7 to 10 meters. 



10 Discussion 



In this article, we consider the asymptotic distribution of the relative density of two proximity catch di- 
graphs (PCDs), namely, proportional-edge PCDs and central similarity PCDs for testing bivariate spatial 
point patterns of segregation and association against complete spatial randomness (CSR). To our knowledge 
the PCD-based met h ods a r e the only graph the o retic tools for testin g spat ial point patter ns in literature 
( Cevhan and Priebd (|2005l) . ICevhan et all (|2006f ) . ICevhan et al.l (|2007l ). and lCevhanl (|2010bl) ). 

We first extend the expansion parameter of the central similarity PCD which was introduced in lCevhan and Priebe 
( 2003al ) and lCevhan et al.l (|2007l) to values higher than one. We demonstrate that the relative density of the 
PCDs can be expressed as [/-statistic of order 2 (in estimating the arc probability) and thereby prove the 



asymptotic normality of the relative density of the PCDs. For finite samples, we assess the empirical size and 
power of the relative density of the PCDs by extensive Monte Carlo simulations. For the proportional-edge 
PCDs, the optimal expansion parameters (in terms of appropriate empirical size and high power) are about 
1.5 under mild segregation and values in (2,3) under moderate to severe segregation; and about 2 under 
association. On the other hand, for central similarity PCDs, the optimal parameters are about 7 under 
segregation, and about 1 under association. Furthermore, we have shown that relative density of central 
similarity PCDs has better empirical size performance; and also, it has higher power against the segregation 
alternatives. On the other hand, relative density of proportional-edge PCDs has higher power against the 
association alternatives. 

We also compare the asymptotic relative efficiency of the relative densities of the two PCD families. Based 
on Pitman asymptotic efficiency, we have shown that in general the relative density of proportional-edge 
PCDs is asymptotically more efficient under segregation, while relative density of central similarity PCDs is 
more efficient under association. However, for the above optimal expansion parameter values (optimal with 
respect to empirical size and power), the asymptotic efficiency and empirical power analysis yields the same 
ordering in terms of performance. 

Let the two samples of sizes n and to be from classes X and y, respectively, with X points being used 
as the vertices of the PCDs and y points being used in the construction of Delaunay triangulation. The 
null hypothesis is assumed to be CSR of X points, i.e., the uniformness of X points in the convex hull of y 
points, Cutym). Although we have two classes here, the null pattern is not the CSR independence, since 
for finite m, we condition on to and the locations of the y points (assumed to have no more than three 
co-circular points) are irrelevant. That is, the y points can result from any pattern that results in a unique 
Delaunay triangulation. The relative density of the two PCD families lend themselves for spatial pattern 
testing conveniently, because of the geometry invariance property for uniform data on Delaunay triangles. 

For the relative density approach to be appropriate, the size of X points (i.e., n) should be much larger 
compared to size of y points (i.e., to). This implies that n tends to infinity while to is assumed to be fixed. 
That is, the imbalance in the relative abundance of the two classes should be large for our method to be 
appropriate. Such an imbalance usually confounds the results of other spatial interaction tests. Furthermore, 
by construction our method uses only the X points in CnO^m) which might cause substantial data (hence 
information) loss. To mitigate this, we propose a correction for the proportion of X points outside Cjj^m), 
because the pattern inside ChO?™) might not be the same as the pattern outside Cu(y m ). We suggest 
a two-stage analysis with our relative density approach: (i) analysis for Cjj{y m )> which provides inference 
restricted to X points in Cii(y m ) i (ii) overall analysis with convex hull correction (i.e., for all X points with 
respect to y m )- We recommend the use of normal approximation if n 10 to or more, although Monte Carlo 
simulations suggest smaller n might also work fine. 

There arc many possible types of parametcrizations for the alternatives. The particular parametrization 
of the alternatives in Equation (|15j) is chosen so that the distribution of the relative density under the 
alternatives would be geometry invariant also (i.e., independent of the geometry of the support triangles). 
The more natural alternatives (i.e., the alternatives that are more likely to be found in practice) can be 
similar to or might be approximated by our parametrization. Because under a segregation alternative, the 
X points will tend to be further away from y points and under an association alternative X points will tend 
to cluster around the y points. Such patterns can be detected by the test statistics based on the relative 
density, since under segregation (whether it is parametrized as in Section U or not) we expect them to be 
larger, and under association (regardless of the parametrization) they tend to be smaller. 
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APPENDIX 



Appendix 1: Derivation of fJ> cs (r) and v cs {t) for r > 1 

Let Mc be the center of mass of the standard equilateral triangle T e . By symmetry M cs (t) = P(X 2 <G 
7V C s(Xi,r)) = 6P(X 2 G Ncs(Xi,t), Xi G T(yi, M 3 , M c )) . To calculate this mean, we need to find the 
possible types of Ncs{x\,t) for r > 1. There are three cases regarding Ncs(xi,t). See Figure 1591 for the 
prototypes of these three cases of Ncs(xi, T ) for Xi = (iti,«i) G TO^)- 

Each case j, corresponds to region Rj in Figure l40l and the bounding lines are r§(x), i a m(x), and r?(x) 
with £ am (x) = x/y/3, r 6 (x) = ^, r 7 (x) = - ^"3+"° and s 2 = 

The explicit forms of Rj, j = 1, 2, 3 are as follows: 

Ri = {{x,y) G [0,1/2] x [0,r 6 (x)]}, 

R 2 = {(x,y) G [0,s 2 ] x [r 6 (x),£ am (x)] U [s 2 , 1/2] x [r 6 (x), r 7 (x)}}, 
R3 = {{x,y) G [s 2 ,l/2] x [r 7 (x),4m0)]}. 



P(* 2 G NcsiX^r), X 1 G T s ) = J]p(X 2 G NosiX^r), X 1 G P, 

For xi G 

1/2 r 6( ^(%(4^))^_ ^ 



/■ 7 r 

P(X 2 G Ncs(X u t), X 1 eR 1 ) = 

Jo Jo 



A(T(y 3 ))2 a 12(l + 2r) 



where A(JV cs (X 1} r)) = ^(l + 2r) 2 Vl 2 
For xi G i?2, 



Jr 6{x) J S2 Jr B ( x) J A(T(y 3 )f 2(l + 2r)(2 + r)' 

where A {N CS (Xi,t)) = & (y 1 +2rv 1 + V3«i) 2 . 
For X\ G i?3, 

A,, ^r 7 (x) A(T(y 3 ))^ 3(l + 2r)(2 + r) 

where A (NcsiX^r)) = & 

So P(X 2 G NcsiX^r)) = 2{ Ht^ +T y 

Next, we find the asymptotic variance term. Let 

P™ :=P({X 2 ,X 3 }cNcs(X u t)), P™ :=P({X 2 ,X 3 }cT G 1 S (X 1 ,t)) and 
P™ := P(X 2 € N CS (X 1i t),X 3 g ri S (^i,r)). 

where r x S (a;, r) is the Ti-reqion of x based on Ncs(', T ) and defined as r^ S (a;,r) := {y G 7(3^) : % C 
^cs(y,T)}. See lCevhan and Priebd (l2003bh for more detail. 

Then Cov[/ii 2 , /113] = E[/i i2 /ii 3 ] - E[/ii 2 ]E[/ii 3 ] where 

Epna/iis] = P({X 2 ,X 3 }ciVcs(^i,T))+2P(X 2 eAr C 5(^i,T),X3Gri S (X 1 ,T)) 
+P({X 2) X 3 } c rf* (*,,-)) = P 2 f + 2P C ^ + J** 

Hence «/ os (r) = Cov^ia.M = (P c 2 f +2P^ + P^) _ [2^ cs {r)f. 



Figure 39: The prototypes of the three cases of Ncs(xi,t) for x\ € T(yi, M3, Mc) with r = 2.5. 




To find the covariance, we need to find the possible types of T^ S (xi,t) and Ncsi^i, t) for r e (l,oo). 
There are three cases regarding Ncs{xi,t) and one case for T 1 (x u t). See Fi gurc 00] for the prototype of 
this one case of T 2 (xi, r) for x\ = (ui, vi) £ T(y 3 ), the explicit forms of C,j{r, x) arc 



(s/Zvi + 3ui - 3x) (—\fiivi + 3ui - 3x) 

(i(t,x) = ± j=- — S Q 2 {t,x)=-> 



V3(1 + 2t) ' \/3(1 + 2t) 

. . (3ui + 3r - 3rx - 3a; — a/^wi) A , . —a/3 r + a/3 r a; — 2 «i 

C3(T, x) = = -, CAt,x) = . 

V ' V3(-l + r) 2 + t 

v / 3r a; + 2z; 1 _ (-3 x - 3 r s + 3 u, + ^3 m) _ Vl 

<,5\. T i x ) — r— , i,6[ T T x ) — 7=—_ ; : j S7l r , a; J 



2 + r ' V3(l-r) ' *' v ' ' 1-r - 

By symmetry = 6P({X 2 ,X 3 } C Wcs^t), € T s ), and 

3 

P({X 2 ,X 3 } c 2V C s(*i,t), X 1 G T s ) = ^P({X 2 ,X 3 } C N CS {X u t), X 1 g Rj). 

3=1 

The limits of integration are as in P(X 2 € Ncs(Xi,t), Xi <G T s ) with the integrand being j4 ( jVcs ( Jl ' r )) 



Hence P 2W - 10T "' 9r + 2 
nence, r- cs — 5 (2 T+ i)( r+ 2)- 



Next, by symmetry = 6P({X 2 ,X 3 } C r^C^r), X x G T s ), and 

r, In. t)1 t 2 (10t 2 -5t + i) 



ao 



where A[T 1 ( Sl , T )J = — > 2(2t+ i )(t+2) -• 



p2G _ 2t 2 (10t 2 -5t+i) 
150 r cs - 5(2r+l) 2 (r+2) 2 ' 

Furthermore, by symmetry, = 6P(X 2 G N C s(Xi,r), X 3 G ^(J^r), G T s ), and 

3 

P{X 2 G Nos&ut), X z g I\ S (Xi,t), A'x g T.) = J2 P ( Xa e Nas{X U T), X 3 G rf S (Jfi,T), Xi G ii- 

J=l 

where P(X 2 G Ncs(Xi,t), X 3 G (Xl,t), .Xi G Rj) can be calculated with the same region of 
integration with integrand being replaced by 



M _ t 2 (10-54t-99t 2 +388t 3 + 1062 t 4 +720t 5 + 160-t 6 ) 
ihen F cs ~ 10(2t+1) 4 (t+2) 4 • 

Hence 

„ r , , . 2(160t 8 - 265t 3 -135t 4 + 751t 5 -47t 2 + 1373t 6 + 804t 7 + 24t 

E [ha h 13 ] = — a j 

5 (2r + l) 4 (r + 2) 4 

Therefore, 

168 r 7 + 886 t 6 + 1122 r 5 + 45 t 4 - 470 r 3 - 114 r 2 + 48 t + 16 
" CS(T) " 5(2r+l) 4 (r + 2) 4 " 



For t = oo, it is trivial to see that v cs (t) = 0. 



Appendix 2: Derivation of fi^ (r,e) for r > 1 



We pick the interval e G [0, V3/o) for a demonstrative example in derivation of //^ (r, e). For r G [l — 
\/3e, l), there are seven cases to consider for the form of Ncs{-,t)(xi,e), See Figure BTI for the prototypes 
of these seven cases of Ncs( , ,t)(xi,e). 

Each case j, corresponds to the region Rj in Figure [42j where q\{x) = —\/3x + 2e, 920*0 = 

Wl*±2e g3(a;) = V3x t 2 E -^ i ?4 ( l) = 1 |^ an d Sl ^ E /2, S2 = 2^e/3, S 3 = iff-y . 

, 7 are as follows: 



1-4t 

The explicit forms of Rj , j = 1 , 2 , 



[s 3 , 1/2] x [0,n (*)]}, 

[s 2 ,s 3 ] x [£i 6 (a;),r 6 (x)] U [s 3 ,s 7 ] x [n(x), r 6 (x)] U [s 7 , 1/2] x [n(a:), r 2 (a;)]}, 
[s 7 ,l/2] x [r 2 (x),r 3 (x)}}, 
[s 7 ,l/2] x [r 3 (x),r 6 (x)]}, 

[si,s 2 ] x [^i 6 (x),£ am (x)] U [s 2 ,s 5 ] x [r 6 (a;),€ Qm (x)] U [s 5 ,s 7 ] x [r 6 (x), r 2 (x)} } , 
[S5,se] x [r 2 (i),4mW]U [s 6 ,s 7 ] x [r 2 (x), r 7 (x)] U [s 7 ,l/2] x [r 6 (x), r 7 (x)]}, 
[s 6 ,l/2] x [r 7 (x),£ am (x)]}. 



By symmetry, P(X 2 G JVasr(*i,r,e)) = 6P(X 2 G JVe7s(*i, r, e), Xi G T s 
For xi G i?i, 



Pi 




i?2 


= {(x,y) e 


#3 


= (foy) G 


i?4 


= {(x,y) G 


P 5 


= {(s,y) G 


P(> 


= {(x,y) G 


Pr 


= {(x,y) g 



P(X 2 G Ar C s(Xi,r,e), X 1 G = 
where A {N CS {Xr, r, e)) = (1 + 2 r) 2 « 2 



1/2 yr!(x) ^(JV cs (Xi,r,e)) 2 _ 9 + 256 e 4 - 48 £^4- 288 e 2 - 256 e 3 ^ 



108 (1 + 2t) (4e 2 _i) 2 



For x\ e R-2, 

P (x 2 e N CS{Xl ,r, s) , Xl en 2) J r r w + r r w + r r w ) A{N ^: ))2 ^ = 

\J 8 2 Je 16 (x) J s 3 J ri (x) J s 7 Jr 1 (x) ) ^K 1 (^3 ) ) 

[4e^3\/3-27e + 9\/3r- 16 e 3 + 24e 2 V3 - 24 eV - 48e 3 r - 54 eV - 27er 3 - 81 re - 81er 2 + 

9 V3t 2 + 3 VSt 3 + 72 e 2 V3t + 72 e 2 \/3r 2 + 24 £ 2 \/3t 3 )] / [27 (1 + r) 3 (1 + 2 r) (4 e 2 - l) 2 ] . 

where A(N C s(Xi,T,e)) = (-2e + ui + 2rui + \/3wi) (2e - V3ui + 3i>i + 6rui). 
For xi € R3, 

-1/2 A(N CS (Xi,T,e)) 2 ^ Jr __ 2e 2 (9+ 18e 2 - 16e^3) 

'37 •/ T2 (x) 



P(X 2 e X, e *) = / / ^ XpG&r ^ = 27 (1 + 2.) (4^ I) 2 ' 



where 

A(Ncs{Xx,T,e)) = ^| (4£i/3 - 3 + 4\/3r-ui + 6ui + 2 V^wi - 8e 2 - Wstvi + 2v\ - 8 e Wi + 8 r v 2 + 8 r 2 u 2 - 6u?) 
For ii £ i?4, 

P(Y C N (Y \ v c R ^ /- 1/2 r' ^fetXi.v))' , 2e 2 (9-8 £ y^8 £ 2 ) 

P(X 2 G Ncs{Xx,T,e), Xi G -R 4 ) = / / JnrTv W dydx = - 



/„(.) ^(r(^3)) 2 ^ 27(l + 2r)(4 £ 2_i) 2 ' 
where 

j4 (iVcs f-X"i, r, e)) = f4e \/3 — 3 + 6 V3t vi + 3tti + 3 a/3-ui — 6e 2 — 16 e r ui — u 2 — 8e vi — 4tv 2 — 4tV — 3 it 2 
6 V 

For xi e R5, 

V«l -^leC*) Js 2 Jr 6 (x) Js 5 Jr 6 {x) ) A \ l (^3 ) ) 

[27r 4 -4e 4 r 7 -46£ 4 r 6 -130e 4 r 5 -72£^3r 4 + 144e 2 r 4 -74£ 4 r 4 + 288£ 2 r 3 + 54r 3 + 142 eV- 144 e ^r 3 + 112 rV- 
54 r + 144 r e \/3 - 288 e 2 r - 27 + 72 e \/3 - 144 e 2 ] / [54 (4 e 2 - l) 2 (1 + 2 r) (2 + r) (1 + r) 3 ] , 

where A(N C s(Xi,t,s)) = *f (-2e + uj. + 2rU! + V3«i) (V3ui + «i +2ti>i +2e). 
For xi e Rq, 

P (x, e N^ur.s), x, J r /'™ w + r r + r r w ) A( ^,y»V - 

yJs 5 Jr 2 (i) Js 6 Jr 2 (i) Js 7 Jr 6 (i) / A l J {^3 ) ) 

[4 (l0e 3 r + 9\/3r-27r £ - 8e 2 \/3r- 10 e 3 - 9 V3 + 27 e + 8 e 2 Vfj e]/[27 (2 + r) (1 + 2r) (4e 2 - l) 2 ] , 
where 

J 4(iVcs(Xi,r,e)) = -^-( 4 ^i -8ei)i+8^3ri;i -16ert)i +12ui -8e\/3iii -6 + 8e^3- 12 £ 2 — 17 2 — 4 t f 2 — 

2 *\/3ui vi — 4 v^ui r ui — 3 it 2 — 4 r 2 u 2 



For xi e R7, 

» r / v s ^ n ^ Z 1172 f r6(x) A(N C s(Xi,T,e)) 2 , , 1-2t + t 2 

P(X 2 G Nos(Xi,t,e), Xi G Rr) = / / 17^7^ d ^ da: " 



/ S7 A 3 W ^TOa)) 2 a 3(l-4 e 2)(i + 2 r)(2 + r)' 

where A (Ncs(X u r, e)) = ^ (2 e + 1) (1 - 2 e). 
So 

P(X 2 G Ncs(X!,T,e)) = [(36r 3 -8eV-84eV-144£ 2 r 3 -176e 4 r 3 -192r 2 \/3e 3 + 63r 2 + 140r 2 e 4 -144e 2 r 2 
384 \/3r e 3 + 18 r + 512 e 4 r + 144 e 2 r - 9 - 192 e 3 V3 + 144 e 2 + 240 £ 4 ) r] / [l8 ( 1 + rf (2 + r ) (1 + 2 r ) (4 e 2 - l) 2 ] . 



Appendix 3: The Mean ^ s (t, e) under the Segregation and Associ- 
ation Alternatives 



Derivation of /i^ s (r, e) involves detailed geometric calculations and partitioning of the space of (r, e, Xi) for 
t G [0, 1], e G [0, V3/3), and x\ G T s = T(yi, M 3 , Mq)- A demonstrative calculation is given in Appendix 2 
for e G [0,73/5) and r G [1 - V3e,l]. 

Pes ( r ' e ) un( ier the Segregation Alternatives 

Under segregation, we compute /z^ (r, e) explicitly. For £ G [0,\/3/5), 

2 

^cs ( T > £ ) = H ro l J ( T > £ ) I ( T G ^ ) 
i=i 

where 

(20£ 4 t-36£ 4 -3t + 3)t 2 



wi ,i\T,e) 



tui j2 (r, e) 



18(1-t)(2£ + 1) 2 (2£-1) 2 ' 



80 £ 4 r 3 + 16 £ 4 t 2 - 108 £ 4 r - 144 £ 4 + 192 V3e 3 + 288 £ 2 t + 64 \/3 e r 2 + 4 r 3 
- 288 e 2 - 128 VS e r - 60 r 2 + 64 \/3 e + 45 t - 16^t / [18 (2 r + 1) 2 (2 £ - 1) 2 (2 e + l) 2 ] , 

with the corresponding intervals I\ = [l, 1 — \/3e) an d 1% = [l — \/3~£, l). 
For £ G [x/3/5, a/3/4), 

3 

Mcs ( T ' £ ) = 51 1172 j (r, £) I(r G Zj ) 
i=i 

where W2,j{r, e) = zuij(t, e) for j = 1, 2, and for j = 3, 

vJ2fi{r,e) = [(^-96£V 4 -1584e 4 r 3 -3688e 4 r 2 +512v / 3e 3 r 3 -3636e 4 r+2304v / 3e 3 T 2 -1152£ 4 +3840v / 3e 3 r-576£ 2 r 2 
128 \/3 e r 3 + 8 t 4 + 1536 \/3e 3 - 4320 e 2 r - 320 \/3 e r 2 - 124 r 3 - 2304 e 2 + 640 V3 e r + 150 r 2 + 512 V3 e- 

77 t- 128)t]/[i8(2t + l) 2 (2e- l) 2 (2e + 1) 2 (1 - 2r)], 

with the corresponding intervals T\ = [l, 1 — \f2>£) , Ii = [l — v3£, V3/(2e) — 3/2), andZ 3 = [\/3/(2£) — 
3/2,1). 

For £ G [\/3/4, 2 \/3/7) , fj, s cs (r, e) = Y* =1 w 3 j(r, e) I(r G 2,-) where 

w 3 ,i(t, e) = [(984 e 4 r 3 -3452 £ 4 r 2 -1024 v / 3~£ 3 t 3 +4992 e 4 r+3584 v^eV+1152 £ 2 r 3 -2268 £ 4 -5120 v / 3£ 3 r-4032 e 2 r 
192 \/3 £ r 3 + 2304 v^e 3 + 5760 e 2 r + 672 \/3 £ r 2 + 42 r 3 - 2592 e 2 - 960 VSe r - 141 r 2 + 432 \/3 £ + 192 r - 84)r 2 ] / 

[32 (r - l) 2 (^3 £ - \/3~) * (2 r - 1)] , 



^3: 



3,2 (t, s) = [(3936 £ 4 r 6 -9872 £ 4 r 5 -4096 \/3e 3 T 6 +7144 £ 4 r 4 +10240 %/3eV+4608 £ 2 r 6 +7444 £ 4 r 3 -7168 \/3e 3 T 4 - 
11520 £ 2 r 5 - 768 \/3 £ r 6 - 4368 £ 4 r 2 - 8064 V3e 3 t 3 + 7488 £ 2 r 4 + 1792 \/3 £ r 5 + 136 r 6 - 1836 £ 4 r + 5056 v / 3eV+ 
10656 £ 2 t 3 - 768 VS £ r 4 - 220 r 5 - 144 £ 4 + 1536 V3e 3 r - 7200 £ 2 t 2 - 2464 \/3 £ r 3 - 154 r 4 + 192 VSe 3 - 1152 £ 2 r+ 

1664v / 3£r 2 + 771r 3 -288£ 2 +48\/3£r-464r 2 + 64\/3£ + 28r-16)r] / /[32 (3 £ - \/3") 4 (2 r- 1)(2 r+l) 2 (r- l) 2 j , 



^3,3(t, e) = [(2096 e 4 r 5 -3376 eV-2048 \/3£ 3 r 5 + 2204 e 4 r 3 + 3840 v / 3W + 2304 eV+3864 eV-2304 v^eV- 
4608£ 2 t 4 -384v^£t 5 -1332£ 4 t-4864v^£V+2016£ 2 t 3 +704v^£t 4 +68^ 

76r 4 + 1536\/3£ 3 -1728£ 2 r-1440^3£r 2 -115r 3 -2304e 2 + 208^3er + 328r 2 + 512^3e + 4r-128)rJ/[32 (r-1) 2 

(-3e + ^3) 4 (2r + l) 2 ], 



ros,4(r, s) = [(2064 eV - 1696 e 4 r 5 - 2048 \/3e 3 t 6 - 3292 e 4 r 4 + 1920 v^eV + 2304 e 2 -r 6 + 788 e 4 r 3 + 2816 V3s 3 t 4 - 

2304e 2 r 5 -384\/3£r 6 -3948£ 4 r 2 -2528%/3£ 3 T 3 -3168£ 2 r 4 +320\/3er 5 +68 r 6 -5940 e 4 r +3872 %/3e 3 r 2 +4896 £ 2 r 3 + 

672 V3 e r 4 -8 r 5 -1800 £ 4 +7392 \/3e 3 r-3600 £ 2 t 2 -1088 \/3 s r 3 -191 r 4 +2400 \/3e 3 -10080 e 2 r+304 \/3 s r 2 +213 r 3 -3600 e 2 

+ 1968\/3er + 44r 2 + 800 ^3 £ - 412 r - 200)r] / [32 (2 r + l) 2 (-3e + Vsj (r + 1)(t - l) 2 ] , 



W3,5(r,e) = [(1032 e 4 r 5 + 3280£ 4 r 4 -1024^3e 3 r 5 + 2186e 4 r 3 -3136^3e 3 r 4 + 1152e 2 r 5 -1806e 4 r 2 - 1920 \/3e 3 r 3 + 
3456 eV - 192 Vs £ r 5 - 2376 £ 4 r + 2384 \/3e 3 r 2 + 2448 sV - 576 V3 e r 4 + 36 r 5 - 648 £ 4 + 3024 V3e 3 T - 2736 £ 2 r 2 - 
624v / 3~£t 3 + 108 r 4 + 864 ^3e 3 -4104 e 2 r+264 ^3 e r 2 + 207 r 3 - 1296 £ 2 + 768 ^3 e r+54 r 2 + 288 v^e- 144 t- 72)t] / 

[l6(2r + l) 2 (3e- v / 3)V + 1)(t + 2)], 

with the corresponding intervals Ii = [l,l — \/3e), I2 = [l — v3e, V3/(2e) — 3/2), I 3 = [\/3/(2e) — 
3/2,\/3/£-3), I 4 = [\/3/e-3,4 (l - V3e)), and 2s = [4 (l - VSe) ,l). 

For e e [2^3/7,^/3), 

2 

i=l 

where W4j(t, e) = ro3.j(r, e) for j = 1, . . . , 5, and for j = 6, 7, 8 

074,6(7-, s) = [-254 eV - 1998 e 4 r 3 + 256 v / 3e 3 r 4 - 4752 e 4 r 2 + 2160 v / 3e 3 r 3 - 288 eV - 4320 e 4 r + 5328 v / 3e 3 r 2 - 
2592 e 2 r 3 +48 v^e r 4 -1296 e 4 +5184 v / 3£ 3 r-6552 e 2 r 2 +456 s r 3 -9 r 4 +1728 \/3£ 3 -6912 £ 2 r+1152 Vs £ r 2 -90 r 3 - 
2592 e 2 + 1344 v 7 ! £ r - 216 r 2 + 576 \/3 e - 288 r - 144] / [l6 (r + 2) (-3 e + V3) * (r + 1)] , 



■074,7(7-, e) = [-256 e 4 r 4 - 1536 e 4 r 3 + 256 v^eV - 2160 e 4 r 2 + 1664 V3s 3 t 3 - 288 eV - 2160 e 4 r + 2304 V3e 3 T 2 - 
2016 e 2 r 3 + 48v^er 4 - 1296 e 4 + 2592v^e 3 r-2664e 2 r 2 + 360v / 3£r 3 -9r 4 + 1728v / 3£ 3 -3456e 2 r + 432v / 3er 2 -72 r 3 

- 2592 e 2 + 672 VSe t - 72 r 2 + 576 V3 £ - 144 r - 144] / [l6 (-3 e + v^) 4 (r + l)r] , 

ra7 4 ,8(r,£) = 1, 

with the corresponding intervals X\ = [l, 1 — V3e), I2 = [l — \/3 £, V / 3/(2 £) — 3/2), I3 = [\/3/(2£) — 

vg^l-V3eh _ [v/3(l-v/3£) 



3)), J 7 = [2 (V3/£-3), 2 f F (1 - / g £) ),andX 8 = 



3/2 J V3/£-3) ) 2 4 =[v / 3/ £ -3,4(l-^£)),2 5 =[4 (l - V3 g ) , ^IX^ ^ = [ "^J^ , 2 (V3/e- 

/3 (l-3e 
4e-v / 3 



2 %/3 (l-3e) -, 



\jA (t, e) under the Association Alternatives 

Under association, we compute fig S (r, e) explicitly. For e £ [0, \/3/21 w .0825), Mcs( T > £ ) = 2j=i ro ij'( r i e ) I( T £ 
X,) where 

OTi.i (t, e) = [(-63936 rV+20736 v^rV- 145728 r 5 £ 4 -6912 r 6 e 2 +46848 v / 3t 5 £ 3 + 181872 rV+192 V3t 6 £-14976 r 5 e 2 - 
60480 \/3tV + 346896 r 3 e 4 + 36 r 6 + 256 V3t 5 e + 22464 tV - 107712 %/3r 3 £ 3 - 296640 r V + 128 r 5 - 1200 %/3r 4 £+ 
28512 r 3 e 2 +93696 v^r V-228528 r e 4 +63 r 4 +1056 VSt 3 £-27360 rV +74304 V3t £ 3 +81648 £ 4 -726 r 3 -208 V3t 2 e- 

25056 re 2 - 25920 v^e 3 + 445 r 2 + 768 VSt £ + 7776 £ 2 + 108 r - 108)r 2 j / [l8 (2 r + 1) (2 r - 1) (-6 £ + v^) ^ 

(6e + V3) 2 (r + 2) 2 (r-l) 2 ], 

07i, 2 (r, e) = [(-62784 rV + 19200 v^^e 3 - 199872 rV-4608 r V+56832 v / 3tV+34992 t 5 £ 4 -320 V3t 7 e-9216 rV- 

28224 \/3r 5 e 3 +476640 tV+164 r 7 -1600 VSt 6 £+22464 r'V-151104 VSr 4 e 3 +82368 r 3 e 4 +484 r 6 -1872 v^r 5 £+31392 tV+ 
6528 v / 3t 3 £ 3 -405216 r 2 e 4 -15 r 5 +3872 VSt 4 £-31392 rV+153792 \/3r 2 e 3 -252720 r £ 4 -1214 r 4 +3280 V3t 3 £-47520 r 2 £ 2 + 

67392 v^r £ 3 -46656 e 4 -13 r 3 -768 VSr 2 £-7776 r e 2 +324 r 2 + 108 r)rj / [l8 (r-l) 2 (2 r+1) 2 (r+2) 2 (-6 e + V^) 2 (6 e + V^) 2 ] , 

07!, 3 (t, e) = [(-62784 rV+19200 v^tV- 74304 rV-4608 rV + 18432 v^t 5 £ 3 +183600 rV-320 V3t 6 e-65088 v^r 4 £ 3 + 
179424 r 3 e 4 +164 r e -960 VSt 5 e+22464 rV-67584 \/3r 3 e 3 +3456 tV + 156 r 5 +48 V3t 4 £+21456 r 3 e 2 +1728 V / 3r 2 £ 3 + 
7776 re 4 - 327 r 4 - 112 V3t 3 e - 4320 r V + 10368 V3t e 3 + 11664 £ 4 - 74 r 3 - 384 V3t 2 e - 3888 r £ 2 + 135 r 2 + 54 r)r] / 

[l8 (r-l) 2 (2r + l) 2 (r + 2) (-6£ + V^) 2 (eE + v^) 2 ], 

07i, 4 (r, e) = [(-63072 rV + 19584 \/3r 4 e 3 - 267552 r 3 e 4 -5184 rV+81408 V / 3r 3 e 3 - 389304 rV-192 V3t 4 e-20160 r 3 e 2 + 
118176 v^tV -233712 r £ 4 +132 r 4 -896 V3t 3 e-29520 tV +71712 V3t £ 3 -48600 e 4 +488 r 3 -1072 V3t 2 £-18576 re 2 + 
15552 V3e 3 + 601 r 2 - 384 V3t e - 3888 £ 2 + 297 r + 54)r 2 ] / [l8 (2 r + l) 2 (r + 2) (r + 1) (-6 £ + V^) ^ £ + V^) 2 ] , 

07i,s(r, e) = [-49968 rV + 15936 v / 3r 5 £ 3 -219384 r 4 £ 4 -4896 r 5 £ 2 +64992 v^tV- 349920 r 3 £ 4 +32 v^r 5 £-18000 r 4 £ 2 + 
90720 v / 3r 3 £ 3 -270216 r 2 £ 4 +58 r 5 + 16 V3t 4 £-22032 rV+51840 V / 3r 2 £ 3 -112752 r £ 4 +191 r 4 -9072 r 2 £ 2 +10368 V3t e 3 - 

23328 £ 4 + 189 r 3 + 54 r 2 ] / [l8 (r + 2) (6 £ + VsJ ' (-6 £ + sft\ 2 (2 r + 1) (r + 1)] , 

07i, 6 (r, s) = [-50040 r 6 £ 4 + 16032 \/3r 6 £ 3 -221220 r 5 £ 4 -5040 r 6 £ 2 +66864 V&r s e 3 -368100 rV+64 V3t 6 £-19944 r 5 £ 2 + 
103728 V^tV -356616 r 3 £ 4 +50 r 6 +256 V3t 5 £-29880 tV +88992 v / 3r 3 £ 3 -307152 r 2 £ 4 + 179 r 5 +304 V3t 4 £-17712 r 3 £ 2 + 
46656 %/3-rV - 194400 r £ 4 + 185 r 4 + 96 VSt 3 e - 2592 r 2 e 2 + 10368 \/3r £ 3 - 46656 £ 4 + 54 r 3 ] j [l8 (r + 1) ^6 £ + sft} 2 

(-6e + \/3) 2 (2r + 1)(t + 2)r] , 

07i, r (r, s) = [3 (-1512 rV+480 \/3r 4 £ 3 -3780 r 3 £ 4 -144 r 4 £ 2 + 1200 V3t 3 s 3 +216 r 2 £ 4 -360 r 3 £ 2 +480 %/3r 2 £ 3 +4752 r £ 4 + 
2 r 4 - 288 r 2 £ 2 + 1728 £ 4 + 5 r 3 + 2 r 2 ) j j [2 ^6 £ + V^) 2 (-6 £ + Vtj ^ (2 r + 1) (r + 2) j , 



with the corresponding intervals T\ 



0, 



3 s/Se 



2 (1— </Ze 



3 V3e 2 y^e 

2 (l-V3e) ' 1-2 v^e 



2\/3e 3 \/3£ 
1-2 V3e' 1-V3t 



3 \/3 £ 3 V3s 
_1-V3e' 1-4v^Se 

For e e [^3/21,^3/12), 



3\/3e 6\/3e A T _ [ 6\Z3e 6V3e A ar ,j 7- _ [ 6 \/3e 1 
l-4v / 3e'T^I7j' ^ 6 ~ 1=737' 1-4 V3e J ailCL ^ 7 " Ll-4^' 1 



J=l 



where tU2.j(r, e) = 7Z7ij(t, e) for j = 1,2,3,4 with the corresponding intervals X\ 



0, 



3\/3j 



2 (l-%/3e) 



3 \/3e 



2y / 3e 



2 {l-sfiz) ' l-2 v / 3e 

For £ e [\/3/12,2 V3/15) 



2 \Z3e 3 %/3e 



l-2V3e' l-\/3( 



r- I , and Ta 



3 V3e 
1-V3e 



i=l 



where 



W3,i(t,£) 



(r 2 +5r + 9)r 2 



18(r + 2) 



2 ' 



ws,2(r, e) = - [(-216 tV + 288 v^r V - 1836 tV - 432 r 6 e 2 + 2160 V3t 5 s 3 - 6786 rV + 96 v^r 6 e - 2808 r 5 e 2 + 

- 5 + 1000 \/3r 4 e-6498 rV 



6600 v / 3r 4 e 3 -13401 rV-24 r 6 +528 VSt 5 e-6876 rV+9588 v / 3r 3 e 3 -16074 r V-108 r 5 + 1000 V3t 4 e-6498 r 3 e 2 H 
7500 v / 3r 2 e 3 - 10611 r e 4 - 154 r 4 + 484 v^r 3 e - 2178 t V + 2484 V3t e 3 - 2916 e 4 - 25 r 3 - 92 V3t 2 e + 162 r e 2 + 23 r 

- 36 V^Te + 9 t)t]/ [2 (2r- 1)(2 r + l) 2 (r + 2)° ' 

with the corresponding intervals X\ = 0, - ^ ^| 7 ^ , and Z2 = 2 ^ ^ , ■ 
For e e [2V3/15,\/3/3), we have Mc S ( r ' e ) = w 3 ,i(t,£)I(t e [0,1]). 



-3e + 



4)*]. 




Figure 41: The seven possible cases of Ncs(-, T )(%) for four distinct x £ Rcm^s) (shaded regions). 




Figure 42: The regions corresponding to the prototypes of the seven cases shown in Figure |4"T1 



